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TITLE 

A METHOD FOR HIGH-DENSITY MICROARRAY MEDIATED 

GENE EXPRESSION PROFILING 
This application claims the benefit of U.S. Provisional Application 
5 No. 60/159,898, filed October 15, 1999. 

FIELD OF THE INVENTION 
This invention is in the field of bacterial gene expression. More 
specifically, this invention is a method for the high density, microarray-mediated 
gene expression profiling of Escherichia coli for comprehensive gene expression 
10 analysis. 

BACKGROUND OF THE INVENTION 
Escherichia colt has been exhaustively studied for over 50 years. Early 
experiments measured the molecular fluxes from small compounds into 
macromolecular constituents. These studies were followed by others in which 

15 small molecule pools of central metabolic building blocks, nucleotides and amino 
acids were determined. The levels of several macromolecular components, 
including individual species of proteins, have been measured. Such measurements 
of the steady state provide a census of the cellular content while changes upon 
imposition of a stress catalogue the cell's fight for survival. This response to an 

20 insulting or adverse condition can take many forms from relieving end product 
inhibition to derepressing transcription. 

In & coli experiments to define stress-related, global regulatory responses 
have often relied upon one of two approaches. In the first, operon fusions induced 
by a particular stress are isolated. In the second, proteomic measures in which the 

25 protein.fractions from stressed and un-stressed cultures are separated by a 
two-dimensional method and then compared. Each method has an inherent 
technological hurdle; for the former, the map location of responsive gene fusions 
must be known precisely, and for the latter, induced or repressed proteins excised 
from the two-dimensional gels must be identified. 

30 Another method uses a transposon-mediated mutagenesis (Spector et al. J. 

Bacteriol 170:345-351 (1988)). A reporter gene is inserted at a random location 
in the genome using a transposon. By assaying for the reporter gene before and 
after the treatment, genes affected by the treatment can be mapped and cloned by 
using the linked transposon as a marker. However, this method is limited to 

35 non-essential genes. 

Alternatively, mRNA measurements utilizing techniques (such as 
hybridization to DNA and primer extension) have allowed the monitoring of 



1 



WO 01/29261 PCT7US00/28352 



individual gene's expression profiles. DeRisi et al. (Science 278:680-686 (1997)) 
reported the expression profiling of most yeast genes. The measurements were 
facilitated by high-density arrays of individual genes and specific labeling of 
cDNA copies of eukaryotic mRNA using polyA tail-specific primers. The lack of 
5 a polyA tail and the extremely short bacterial mRNA half life represent hurdles for 
the application of DNA micro-array technology to prokaryotic research. 

A comprehensive expression profiling has been performed previously with 
the yeast Saccharomyces cerevisiae. Adaptation of RNA isolation and labeling 
protocols from eukaryotes to prokaryotes is not straightforward since eukaryotic 
10 mRNA manipulations often exploit 3'-polyadenylation of this molecular species. 

Chuang et al. (/. Bacterol 175:2026-2036 (1993)) reported an expression 
profiling using large DNA fragments from an ordered X library of E. coli genomic 
fragments as a capture reagent. It allowed the comparison of the expression 
patterns from large portions of DNA fragments by comparing mRNA levels from 
15 stressed and unstressed E. coli cultures. The resolution of this method, however, 
was unsatisfactory. Expression of groups of genes, as opposed to the expression 
of each individual gene was measured. Moreover, the method used radio-labeled 
DNA as a probe with the incumbent need for safety precautions. Furthermore, the 
use of radio-labeled probe prevents the simultaneous measurement of the 
20 expression level in a test sample and a control sample. 

Richmond et al. (Nucleic Acids Research, 19:3821-3835 (1999)) has 
recently reported genome-wide expression profiling of E. coli at a single ORF 
level of resolution. Changes in RNA levels after exposure to heat shock or IPTG 
were analyzed using comprehensive low density blots of individual ORFs on a 
25 nylon matrix and comprehensive high density arrays of individual ORFs spotted 
on glass slides. The results of the two methods were compared. 

The methods recited above permit monitoring of the effect of 
environmental changes on gene expression by comparing expression levels of a 
limited number of genes. They, however, fail to monitor the comprehensive 
30 responses of a preponderance of individual genes in the genome of an organism in 
reliable, useful manner. 

The problem to be solved, therefore, is to provide a way to measure the 
comprehensive gene expression profile analysis of the organism. 

SUMMARY OF THE INVENTION 
35 The invention provides a method for identifying gene expression changes 
within a bacterial species comprising: 

(a) providing a comprehensive micro-array synthesized from DNA 
comprised in a bacterial species; 
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(b) generating a first set of labeled probes from bacterial RNA, the 
RNA isolated from the bacterial species of step (a); 

(c) hybridizing the first set of labeled probes of step (b) to the 
comprehensive micro-array of step (a), wherein hybridization 

5 results in a detectable signal generated from the labeled probe; 

i (d) measuring the signal generated by the hybridization of the first set 

of labeled probe to the comprehensive micro-array of step (c); 

(e) subjecting the bacterial species of step (a) to a gene expression 
altering condition whereby the gene expression profile of the 

10 bacterial species is altered to produce a modified bacterial species ; 

(f) generating a second set of labeled probes from bacterial RNA, the 
RNA isolated from the modified bacterial species of step (e); 

(g) hybridizing the second set of labeled probes of step (f) to the 
comprehensive micro-array of step (a), wherein hybridization 

15 results in a detectable signal generated from the labeled probe; 

(h) measuring the signal generated by the hybridization of the second 
set of labeled probes to the comprehensive micro-array of step (g); 
and 

(i) comparing signal generated from the first hybridization to the 
20 signal generated from the second hybridization to identify gene 

expression changes within a bacterial species. 
Additionally the invention provides a method for identifying gene 
expression changes within a bacterial strain comprising: 

(a) providing a comprehensive micro-array synthesized from DNA 
25 comprised in a bacterial species ; 

(b) generating a first set of fluorescent cDNA from bacterial RNA, the 
RNA isolated from the bacterial species of step (a); 

(c) hybridizing the first set of fluorescent cDNA of step (b) to the 
comprehensive micro-array of step (a), wherein hybridization 

30 results in a detectable signal generated from the fluorescent cDNA; 

(d) measuring the signal generated by the hybridization of the first set 
of fluorescent cDNA to the comprehensive micro-array of step (c); 

1 (e) subjecting the bacterial species of step (a) to a gene expression 
altering condition whereby the gene expression profile of the 
35 bacterial species is altered to produce a modified bacterial species; 

(f) generating a second set of fluorescent cDNA from bacterial RNA, 
the RNA isolated from the modified bacterial species of step (e); 
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(g) hybridizing the second set of fluorescent cDNA of step (f) to the 
comprehensive micro-array of step (a), wherein hybridization 
results in a detectable signal generated from the fluorescent cDNA; 

(h) measuring the signal generated by the hybridization of the second 
5 set of fluorescent cDNA to the comprehensive micro-array of 

step (g); and 

(i) comparing signal generated from the first hybridization to the 
signal generated from the second hybridization to identify gene 
expression changes within a bacterial species . 

10 In an alternate embodiment the invention provides a method for 

identifying gene expression changes within a genome comprising: 

(a) providing a comprehensive micro-array synthesized from DNA 
comprised in a prokaryotic or eukaryotic speices; 

(b) generating a control set of fluorescent cDNA from total or 
15 polyadenylated RNA, the RNA isolated from the species of 

step (a), the fluorescent cDNA comprising at least one first 
fluorescent label and at least one different second fluorescent label; 

(c) mixing the control set of fluorescent cDNA labeled with the at least 
one first label with the control set of fluorescent cDNA labeled 

20 with the at least second first label to for a dual labeled control 

cDNA; 

(d) hybridizing the dual labeled control set of fluorescent cDNA of 
step (c) to the comprehensive micro-array of step (a), wherein 
hybridization results in a detectable signal generated from the 

25 fluorescent cDNA; 

(e) measuring the signal generated by the hybridization of the dual 
labeled control set of fluorescent cDNA to the comprehensive 
micro-array of step (c); 

(f) subjecting the prokaryote or eukaryote of step (a) to a gene 

30 expression altering condition whereby the gene expression profile 

of the prokaryote or eukaryote is altered to produce a modified 
prokaryote or eukaryote ; 

(g) generating an experimental set of fluorescent cDNA from total or 
polyadenylated RNA, the RNA isolated from the modified 

35 prokaryote or eukaryote of step (e), the fluorescent cDNA 

comprising the first fluorescent label and the different second 
fluorescent label to step (b); 
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(h) mixing the experimental set of fluorescent cDNA labeled with the 
at least one first label with the experimental set of fluorescent 
cDNA labeled with the at least second first label to form a dual 
labeled experimental cDNA; 
5 (i) hybridizing the experimental set of fluorescent cDNA of step (h) to 

the comprehensive micro-array of step (a), wherein hybridization 
results in a detectable signal generated from the fluorescent cDNA; 

(j) measuring the signal generated by the hybridization of the second 
set of fluorescent cDNA to the comprehensive micro-array of 
10 step (g); and 

(k) comparing signal generated from the dual labeled control 

hybridization with the dual labeled experimental hybridization to 
identify gene expression changes within a prokaryotic or 
eukaryotic species. 

15 In another embodiment the invention provides a method for quantitating the 

amount of protein specifying RNA contained within a genome comprising: 
(a) providing a comprehensive micro-array comprising a multiplicity 
of genes synthesized from genomic DNA comprised in a 
prokaryotic or eukaryotic organism; 
20 (b) generating a set of fluorescent cDNA from total or poly-adenylated 

RNA isolated from the prokaryotic or eukaryotic organism of 
step (a); 

(c) generating a set of fluorescent DNA from genomic DNA isolated 
from the prokaryotic or eukaryotic organism of step (a); 
25 (d) hybridizing the fluorescent cDNA of step (b) to the comprehensive 

micro-array of step (a), wherein hybridization results in a first 
fluorescent signal generated from the fluorescent cDNA for each 
gene; 

(e) hybridizing the fluorescent DNA of step (c) to the comprehensive 
30 micro-array of step (a), wherein hybridization results in a second 

fluorescent signal generated from the fluorescent DNA for each 
gene; and 

(f) dividing, for each open reading from, the first fluorescent signal 

t into the second fluorescent signal to provide a quantitated measure 

35 of the amount of protein specifying RNA for each gene. 

The methods of the present invention are applicable to genomes contained 
within a variety of organisms including bacteria, cyanobacteria, yeasts, 
filamentous fungi, plant cells and animal cells. 

5 
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The present methods of identifying gene expression changes within 
genome may be additionally coupled with the methods of quantitating the amount 
of protein specifying RNA contained within a genome as disclosed herein. 
BRIEF DESCRIPTION OF THE DRAWINGS AND 
5 SEQUENCE DESCRIPTIONS. 

Figure 1 A describes the gene expression analysis of IPTG induction in a 
single hybridization experiment using different slide sets as capture reagents for 
Cy3-labeled cDNA derived from treated and control cells and plotted in log-log 
form. 

10 Figure IB describes the gene expression analysis of IPTG induction by 

labeling the control sample with Cy5 and the induced sample with Cy3 before 
hybridizing to a single set of 3 slides. 

Figure 1C describes an average of induced RNA and control RNA with 
Cy3 from IPTG induction, generated by label swapping. 
15 Figure ID describes data replicating the results shown in Figure 1 C. 

Figure IE describes an averaging of the data of Figure 1C and Figure ID. 
Figure 2 describes the distribution of gene expression levels for cells 
grown in minimal or rich medium. 

Figure 3 describes the fractional (summed open reading frame 
20 transcripts/total open reading frame transcripts) analysis of gene expression. 

The invention can be more fully understood from the following detailed 
description and the accompanying sequence descriptions which form a part of this 
application. 

The following sequences comply with 37 C.F.R. 1.821-1.825 
25 ("Requirements for Patent Applications Containing Nucleotide Sequences and/or 
Amino Acid Sequence Disclosures - the Sequence Rules") and are consistent with 
World Intellectual Property Organization (WIPO) Standard ST.25 (1998) and the 
sequence listing requirements of the EPO and PCT (Rules 5.2 and 49.5(a-bis), and 
Section 208 and Annex C of the Administrative Instructions). The symbols and 
30 format used for nucleotide and amino acid sequence data comply with the rules set 
forth in 37 C.F.R. §1.822. 

SEQ ID NO:l and 2 are primers used in the amplification of the sdiA gene. 

DETAILED DESCRIPTION OF THE INVENTION 
Applicants have solved the stated problem by providing a method to 
35 measure a comprehensive mRNA expression of E. coli using a high density DNA 
microarray with a near-complete collection of E. coli open reading frames (ORFs). 
The present invention advances the art by providing: 
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(i) the first instance of a comprehensive micro-array comprising greater 
than 75% of all open reading frames from a prokaryotic organism, overcoming the 
problems of high concentration of endogenous RNAase and ribosomal RNA; 

(ii) a method for quantitating the amount of each protein specifying RNA 
5 contained within a culture; and 

(iii) a method for decreasing the background noise generated within a gene 
expression profile through the combination of multiple signal generating labels. 

The present invention has utility in many different fields. Many discovery 
compounds can be screened by comparing their gene expression profile to a 
10 known compound that affects the desirable target gene products. Additionally 
gene expression profiles are good indicators of genotypic alterations among 
strains. The present invention may allow the discovery of complementary target 
inhibitors in combination drug-therapy and may be used as a modeling system to 
test perturbations in process conditions to determine the conditions for the high 
15 yield of desired production in various bio-processes and biotransformations. 

In this disclosure, a number of terms and abbreviations are used. The 
following definitions are provided. 

"Open reading frame" is abbreviated ORR The term "ORF" is refers to a 
gene that specifies a protein. 
20 "Polymerase chain reaction" is abbreviated PGR. 

The term "micro-array" means an array of regions having a density of 
discrete regions of oligonucleotides of at least about 100/cm 2 , and preferably at 
least about 1000/cm 2 . 

The term "comprehensive micro array" refers to high-density micro-array 
25 containing at least 75% of all open reading frames of the organism. 

The term "expression profile" refers to the expression of groups of genes. 

The term "gene expression profile" refers to the expression of an 
individual gene and of suites of individual genes. 

The "comprehensive expression profile" refers to the gene expression 
30 profile of more than 75% of all genes in the genome. 

The term "high density" as used in conjunction with micro-array means 
and array having an array density of generally greater than about 60, more 
generally greater than about 100, most generally greater than about 600, often 
greater than about 1000, more often greater than about 5,000, most often greater 
35 than about 10,000, preferably greater than about 40,000 more preferably greater 
than about 100,000, and most preferably greater than about 400,000 different 
nucleic acids per cm. 2 
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As used herein, an "isolated nucleic acid fragment" is a polymer of RNA 
or DNA that is single- or double-stranded, optionally containing synthetic, non- 
natural or altered nucleotide bases. An isolated nucleic acid fragment in the form 
of a polymer of DNA may be comprised of one or more segments of cDNA, 
5 genomic DNA or synthetic DNA. 

The term "probe" refers to a single-stranded nucleic acid molecule that can 
base pair with a complementary single stranded target nucleic acid to form a 
double-stranded molecule. 

The term "genotype" refers to the genetic constitution of an organism as 
10 distinguished from its physical appearance. 

The term "genomic DNA" refers to total DNA from an organism. 
The term "total RNA" refers to non-fractionated RNA from an organism. 
The term "protein specifying RNA" or "protein specifying transcript" or 
"mRNA" refers to RNA derived from ORF. 
15 The term "label" will refer to a substance which may be incorporated into 

DNA or RNA which will emit a detectable signal under various conditions. 
Typically a label will be a fluorescent moiety. 

A nucleic acid molecule is "hybridizable" to another nucleic acid 
molecule, such as a cDNA, genomic DNA, or RNA, when a single stranded form 
20 of the nucleic acid molecule can anneal to the other nucleic acid molecule under 
the appropriate conditions of temperature and solution ionic strength. 
Hybridization and washing conditions are well known and exemplified in 
Sambrook, J., Fritsch, E. F. and Maniatis, T. Molecular Cloning: A Laboratory 
Manual Second Edition, Cold Spring Harbor Laboratory Press, Cold Spring 
25 Harbor (1 989), particularly Chapter 1 1 and Table 11.1 therein. The conditions of 
temperature and ionic strength determine the "stringency" of the hybridization. 
Hybridization requires that the two nucleic acids contain complementary 
sequences, although depending on the stringency of the hybridization, mismatches 
between bases are possible. The appropriate stringency for hybridizing nucleic 
30 acids depends on the length of the nucleic acids and the degree of 

complementation, variables well known in the art. The greater the degree of 
similarity or homology between two nucleotide sequences, the greater the value of 
Tm for hybrids of nucleic acids having those sequences. The relative stability 
(corresponding to higher Tm) of nucleic acid hybridizations decreases in the 
35 following order: RNA:RNA, DNA:RNA, DNA:DNA. For hybrids of greater 
than 100 nucleotides in length, equations for calculating Tm have been derived 
(see Sambrook et al., supra, 9.50-9.5 1). For hybridizations with shorter nucleic 
acids, i.e., oligonucleotides, the position of mismatches becomes more important, 
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and the length of the oligonucleotide determines its specificity (see Sambrook 
et al., supra, 1 1 .7-1 1.8). Furthermore, the skilled artisan will recognize that the 
temperature and wash solution salt concentration may be adjusted as necessary 
according to factors such as length of the probe. 
5 The term "complementary" is used to describe the relationship between 

nucleotide bases that are capable to hybridizing to one another. For example, with 
respect to DNA, adenosine is complementary to thymine and cytosine is 
complementary to guanine. 

"Gene" refers to the part of the genome specifying a macromolecular 
10 product be it RNA or a protein and include regulatory sequences preceding 
(5* non-coding sequences) and following (3* non-coding sequences) the coding 
sequence. 

A "genetic site" refers to a genomic region at which a gene product 
operates. 

15 "Coding sequence" or "open reading frame" (ORF) refers to a DNA 

sequence that codes for a specific amino acid sequence. "Suitable regulatory 
sequences" refer to nucleotide sequences located upstream (5 1 non-coding 
sequences), within, or downstream (3' non-coding sequences) of a coding 
sequence, and which influence the transcription, RNA processing or stability, or 

20 translation of the associated coding sequence. Regulatory sequences may include 
promoters, translation leader sequences, introns, and polyadenylation recognition 
sequences. 

"Promoter" refers to a DNA sequence capable of controlling the 
expression of a coding sequence or functional RNA. In general, a coding 

25 sequence is located 3* to a promoter sequence. Promoters may be derived in their 
entirety from a native gene, or be composed of different elements derived from 
different promoters found in nature, or even comprise synthetic DNA segments. It 
; is understood by those skilled in the art that different promoters may direct the 
expression of a gene in different tissues or cell types, or at different stages of 

30 development, or in response to different environmental conditions. Promoters 

which cause a gene to be expressed in most cell types at most times are commonly 
referred to as "constitutive promoters". It is further recognized that since in most 
cases the exact boundaries of regulatory sequences have not been completely 
defined, DNA fragments of different lengths may have identical promoter activity. 

35 "RNA transcript" refers to the product resulting from RNA polymerase- 

catalyzed transcription of a DNA sequence. When the RNA transcript is the 
polymer product of an RNA polymerase, it is referred to as the primary transcript 
or it may be a RNA sequence derived from post-transcriptional processing of the 
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primary transcript and is referred to as the mature RNA. "Messenger RNA 
(mRNA)" refers to the RNA that is without introns and that can be translated into 
protein by the cell. "cDNA" refers to a double-stranded DNA that is 
complementary to and derived from mRNA. 
5 The term "expression", as used herein, refers to the transcription and stable 

accumulation of sense (mRNA) or antisense RNA derived from genomic DNA. 
Expression may also refer to translation of mRNA into a polypeptide. 

The term "stress" or "environmental stress" refers to the condition 
produced in a cell as the result of exposure to an environmental insult. 

10 The term "insult" or "environmental insult" refers to any substance or 

environmental change that results in an alteration of normal cellular metabolism in 
a bacterial cell or population of cells. Environmental insults may include, but are 
not limited to, chemicals, environmental pollutants, heavy metals, changes in 
temperature, changes in pH, as well as agents producing oxidative damage, DNA 

15 damage, anaerobiosis, and changes in nitrate availability or pathogenesis. 

The term "stress response" refers to the cellular response to an 
environmental insult. 

The term "stress gene" refers to any gene whose transcription is induced as 
a result of environmental stress or by the presence of an environmental insult. 

20 The term "modified bacterial species" refers to a bacterial culture that has 

been exposed to a stress or insult such that either it demonstrates a change in its 
gene expression profile. Typically the modified bacterial species is produced as 
the result of induction or challenge of the culture with a chemical or 
environmental challenge. Similarly, a "modified prokaryotic or eukaryotic 

25 ' species" refers to either a prokarytoic or eukaryotic organism that has been 

exposed to a stress or insult such that the gene expression profile of that organisms 
as been altered. 

The term "log phase", "log phase growth", "exponential phase" or 
"exponential phase growth" refers to cell cultures of organisms growing under 
30 conditions permitting the exponential multiplication of the cell number. 

The term "growth-altering environment" refers to energy, chemicals, or 
living things that have the capacity to either inhibit cell growth or kill cells. 
Inhibitory agents may include but are not limited to mutagens, antibiotics, UV 
light, gamma-rays, x-rays, extreme temperature, phage, macrophages, organic 
35 chemicals and inorganic chemicals. 

Standard recombinant DNA and molecular cloning techniques used here 
are well known in the art and are described by Sambrook, J., Fritsch, E. F. and 
Maniatis, T., Molecular Cloning: A Laboratory Manual, Second Edition, Cold 

10 
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Spring Harbor Laboratory Press, Cold Spring Harbor, NY (1989) (hereinafter 
"Maniatis"); and by Silhavy, T. J., Bennan, M. L. and Enquist, L. W., 
Experiments with Gene Fusions , Cold Spring Harbor Laboratory Cold Press 
Spring Harbor, NY (1984); and by Ausubel, F. M. et al., Current Protocols in 
5 Molecular Biology, published by Greene Publishing Assoc. and Wiley- 
Interscience (1987). 

The present invention provides a method to measure the changes in gene 
expression profiles of prokaryotic organisms. The present invention also provides 
a method to measure the levels of protein specifying RNA in prokaryotic and/or 

10 eukaryotic organisms. The present invention provides a method to compare the 
gene expression patterns of two samples differing in one variable. The variables 
may include but are not limited to genotype, media, temperature, depletion or 
addition of nutrient, addition of an inhibitor, physical assault, irradiation, heat, 
cold, elevated or lowered pressure, desiccation, low or high ionic strength, and 

15 growth phases. 

Gene expression profiles were determined under the following conditions 
to find: (a) differences in gene expression profiles caused by growth of E. coli in 
either minimal or rich medium, (b) changes in gene expression associated with the 
transition from exponential phase to stationary phase growth in minimal medium, 

20 and (c) the specificity of induction mediated by isopropylthiogalactoside (IPTG), 
the classic lac operon inducer, (d) the specificity of expression changes mediated 
by the amplification of sdiA, a positive activator of an operon that includes 
ftsQAZ, genes essential for septation, and (e) the changes in gene expression 
patterns with cells that cannot turn on the SOS stress response in comparison to 

25 wild type response when the cells are exposed to mitomycin C (MMC). 

In its most basic form the present invention creates a comprehensive 
micro-array from a bacterial genome. Any bacteria is suitable for analysis by the 
method of the present invention where enteric bacteria (Escherichia, and 
Salmonella for example) as well as cyanobacteria (such as Rhodobacter and 

30 Synechocystis and Bacillus, Acinetobacter, Streptomyces, Methylobacter, and 
Pseudomona are particularly suitable. 

One of skill in the art will appreciate that in order to measure the 
transcription level (and thereby the expression level) of a gene or genes, it is 
desirable to provide a nucleic acid sample comprising mRNA transcript(s) of the 

35 gene or genes, or nucleic acids derived from the mRNA transcript(s). As used 

herein, a nucleic acid derived from an mRNA transcript refers to a nucleic acid for 
whose synthesis the mRNA transcript or a subsequence thereof has ultimately 
served as a template. Thus, a cDNA reverse transcribed from an mRNA, an RNA 

11 
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transcribed from that cDNA, a DNA amplified from the cDNA, an RNA 
transcribed from the amplified DNA, etc., are all derived from the mRNA 
transcript and detection of such derived products is indicative of the presence 
and/or abundance of the original transcript in a sample. Thus, suitable samples 
5 include, but are not limited to, mRNA transcripts of the gene or genes, cDNA 
reverse transcribed from the mRNA, cRNA transcribed from the cDNA, DNA 
amplified from the genes, RNA transcribed from amplified DNA, and the like. 

Typically the genes are amplified by methods of primer directed 
amplification such as polymerase chain reaction (PCR) (U.S. Patent 

10 No. 4,683,202 (1987, Mullis, et al.) and U.S. Patent No. 4,683,195 (1986, Mullis, 
et al.), ligase chain reaction ( LCR) (Tabor et al., Proc. Acad. ScL U.S. A, 82, 
1074-1078 (1985)) or strand displacement amplification (Walker et al., Proc. Natl 
Acad. ScL U.S.A., 89, 392, (1992) for example. 

The micro-array is comprehensive in that it incorporates at least 75% of all 

15 ORF's present in the genome. Amplified ORFs are then spotted on slides 

comprised of glass or some other solid substrate by methods well known in the art 
to form a micro-array. Methods of forming high density arrays of 
oligonucleotides, with a minimal number of synthetic steps are known (see for 
example Brown et al., U.S. Patent No. 6,1 10,426). The oligonucleotide analogue 

20 array can be synthesized on a solid substrate by a variety of methods, including, 
but not limited to, light-directed chemical coupling, and mechanically directed 
coupling. See Pirrung et al., U.S. Pat. No. 5,143,854 (see also PCT Application 
No. WO 90/15070) and Fodor et al., PCT Publication Nos. WO 92/10092 and 
WO 93/09668 which disclose methods of forming vast arrays of peptides, 

25 oligonucleotides and other molecules using, for example, light-directed synthesis 
techniques. See also, Fodor et al., Science, 251, 767-77 (1991). 

Bacteria typically contain from about 2000 to about 6000 ORF's per 
genome and the present method is suitable for genomes of this size where 
genomes of about 4000 ORF's are most suitable. The ORF's are arrayed in high 

30 density on at least one glass microscope slide. This is in contrast to a low density 
array where ORF's are arrayed on a membranous material such as nitrocellulose. 
The small surface area of the high density array (often less than about 1 0 cm 2 , 
preferably less than about 5 cm 2 more preferably less than about 2 cm 2 , and most 
preferably less than about 1.6 cm. 2 ) permits extremely uniform hybridization 

35 conditions (temperature regulation, salt content, etc.). 

Once all the genes of ORF's from the genome are amplified, isolated and 
arrayed, a set of probes, bearing a signal generating label are synthesized. Probes 
may be randomly generated or may be synthesized based on the sequence of 
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specific open reading frames. Probes of the present invention are typically single 
stranded nucleic acid sequences which are complementary to the nucleic acid 
sequences to be detected. Probes are "hybridizable" to the ORF's. The probe 
length can vary from 5 bases to tens of thousands of bases, and will depend upon 
5 the specific test to be done. Typically a probe length of about 1 5 bases to about 
30 bases is suitable. Only part of the probe molecule need be complementary to 
the nucleic acid sequence to be detected. In addition, the complementarity 
between the probe and the target sequence need not be perfect. Hybridization 
does occur between imperfectly complementary molecules with the result that a 

10 certain fraction of the bases in the hybridized region are not paired with the proper 
complementary base. 

Signal generating labels that may be incorporated into the probes are well 
known in the art. For example labels may include but are not limited to 
fluorescent moieties, chemiluminescent moieties, particles, enzymes, radioactive 

15 tags, or light emitting moieties or molecules, where fluorescent moieties are 
preferred. Most preferred are fluorescent dyes capable of attaching to nucleic 
acids and emitting a fluorescent signal. A variety of dyes are known in the art 
such as fluorescein, Texas red, and rhodamine. Preferred in the present invention 
are the mono reactive dyes cy3 (146368-16-3) and cy5 (146368-14-1) both 

20 available commercially (i.e.Amersham Pharmacia Biotech, Arlington Heights, 
IL). Suitable dyes are discussed in U.S. Patent No. 5,814,454 hereby incorporated 
by reference. 

Labels may be incorporated by any of a number of means well known to 
those of skill in the art. However, in a preferred embodiment, the label is 

25 simultaneously incorporated during the amplification step in the preparation of the 
probe nucleic acids. Thus, for example, polymerase chain reaction (PCR) with 
labeled primers or labeled nucleotides will provide a labeled amplification 
product. In a preferred embodiment, reverse transcription or replication, using a 
labeled nucleotide (e.g. dye-labeled UTP and/or CTP) incorporates a label into the 

30 transcribed nucleic acids. 

Alternatively, a label may be added directly to the original nucleic acid 
sample (e.g., mRNA, polyA rnRNA, cDNA, etc.) or to the amplification product 
after the synthesis is completed. Means of attaching labels to nucleic acids are 
well known to those of skill in the art and include, for example nick translation or 

35 end-labeling (e.g. with a labeled RNA) by kinasing of the nucleic acid and 
subsequent attachment (ligation) of a nucleic acid linker joining the sample 
nucleic acid to a label (e.g., a fluorophore). 
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Following incorporation of the label into the probe the probes are then 
hybridized to the micro-array using standard conditions where hybridization 
results in a double stranded nucleic acid, generating a detectable signal from the 
label at the site of capture reagent attachment to the surface. Typically the probe 
5 and array must be mixed with each other under conditions which will permit 
nucleic acid hybridization. This involves contacting the probe and array in the 
presence of an inorganic or organic salt under the proper concentration and 
temperature conditions. The probe and array nucleic acids must be in contact for a 
long enough time that any possible hybridization between the probe and sample 

10 nucleic acid may occur. The concentration of probe or array in the mixture will 
determine the time necessary for hybridization to occur. The higher the probe or 
array concentration the shorter the hybridization incubation time needed. 
Optionally a chaotropic agent may be added. The chaotropic agent stabilizes 
nucleic acids by inhibiting nuclease activity. Furthermore, the chaotropic agent 

15 allows sensitive and stringent hybridization of short oligonucleotide probes at 
room temperature [Van Ness and Chen (1991) Nucl Acids Res. 19:5143-5151]. 
Suitable chaotropic agents include guanidinium chloride, guanidinium 
thiocyanate, sodium thiocyanate, lithium tetrachloroacetate, sodium perchlorate, 
rubidium tetrachloroacetate, potassium iodide, and cesium trifluoroacetate, among 

20 others. Typically, the chaotropic agent will be present at a final concentration of 
about 3 M. If desired, one can add formamide to the hybridization mixture, 
typically 30-50% (v/v). 

Various hybridization solutions can be employed. Typically, these 
comprise from about 20 to 60% volume, preferably 30%, of a polar organic 

25 solvent. A common hybridization solution employs about 30-50% v/v 

formamide, about 0.15 to 1 M sodium chloride, about 0.05 to 0.1 M buffers, 
such as sodium citrate, Tris-HCl, PIPES or HEPES (pH range about 6-9), about 
0.05 to 0.2% detergent, such as sodium dodecylsulfate, or between 0.5-20 mM 
EDTA, FICOLL (Pharmacia Inc.) (about 300-500 kilodaltons), 

30 polyvinylpyrrolidone (about 250-500 kdal), and serum albumin. Also included 
in the typical hybridization solution will be unlabeled carrier nucleic acids from 
about 0.1 to 5 mg/mL, fragmented nucleic DNA, e.g., calf thymus or salmon 
sperm DNA, or yeast RNA, and optionally from about 0.5 to 2% wt./vol. 
glycine. Other additives may also be included, such as volume exclusion agents 

35 which include a variety of polar water-soluble or swellable agents, such as 
polyethylene glycol, anionic polymers such as polyacrylate or 
polymethylacrylate, and anionic saccharidic polymers, such as dextran sulfate. 
Methods of optimizing hybridization conditions are well known to those of skill 
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in the art (see, e.g., Laboratory Techniques in Biochemistry and Molecular 
Biology, Vol. 24: Hybridization With Nucleic Acid Probes, P. Tijssen, ed. 
Elsevier, N.Y., (1993)) and Maniatis, supra. 

The basis of gene expression profiling via micro-array technology relies on 
5 comparing an organism under a variety of conditions that result in alteration of the 
genes expressed. Within the context of the present invention a single population 
of cells was exposed to a variety of stresses that resulted in the alteration of gene 
expression. Alternatively, the cellular environment may be kept constant and the 
genotype may be altered. Typical stresses that result in an alteration in gene 

10 expression profile will include, but is not limited to conditions altering the growth 
of a cell or strain, exposure to mutagens , antibiotics, UV light, gamma-rays, 
x-rays, phage, macrophages, organic chemicals, inorganic chemicals, 
environmental pollutants, heavy metals, changes in temperature, changes in pH, 
conditions producing oxidative damage, DNA damage, anaerobiosis, depletion or 

15 addition of nutrients, addition of a growth inhibitor, and desiccation. Non-stressed 
cells are used for generation of "control" arrays and stressed cells are used to 
generate an "experimental", "stressed" or "induced" arrays. 

In an alternate embodiment the present invention provides a method for 
quantitating the amount of each protein specifying RNA contained within an 

20 organism. This is often necessary in gene expression profile analysis because the 
quantity of transcript produced as well as its fold elevation is needed for 
quantitative analysis of the cell's physiological state. The method is applicable to 
both prokaryotic and eukaryotic organisms including for example, cyanobacteria 
(such as Rhodobacter and Synechocystis) yeasts (such as Saccharomyces, 

25 Zygosaccharomyces, Kluyveromyces, Candida, Hansenula, Debaryomyces, 
Mucor, Pichia and Torulopsis), filamentous fungi (such as Aspergillus and 
Arthrobotrys\ plant cells and animal cells. The method proceeds by generating a 
comprehensive micro-array as described above, from either total or 
poly-adenylated RNA, depending on the whether the organism is prokaryotic or 

30 eukaryotic. Following the generation of the array, a set of labeled DNA and a set 
of labeled cDNA are synthesized having complementarity to the ORF's of the 
array. The signals generated from the independent hybridization of either the 
labeled DNA or cDNA are used to quantitate the amount of protein specifying 
RNA contained within a genome. 

35 • In another embodiment the invention provides a method for gene 

expression profiling with a reduced signal to noise ratio. This is accomplished 
using a dual "label swapping" method and is again applicable to both prokaryotic 
and eukaryotic genomes. "Label swapping" refers to a system where a set of 
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probes or cDNA generated from control or experimental conditions are labeled 
with two different labels and mixed prior hybridization with the array. Two sets 
of control and experimental probes or cDNA's are generated. One of the control 
sets is labeled with a first label (i.e. cy3) and the other is labeled with a different 
5 second label (i.e. cy5). The two differently labeled sets are mixed and then 
hybridized with the array. The same process is repeated for the experimental 
conditions and the resulting control and experimental fluorescent signals are 
compared. This combination of signals provides (a) additional measure of each 
transcript level and (b) allows for the canceling of any bias associate with 

10 differential incorporation of fluorescently labeled nucleotide into cDNA or the 
hybridization of that cDNA. 

The preferred embodiments of the invention are discussed below. 
Bulk E. colt RNA was reverse transcribed to prepare hybridization probes. 
Despite the large amount of stable RNA (ribosomal and transfer RNAs) in the 

15 template, hybridization to protein-encoding genes was readily detected. 

As shown in Figure 1 with IPTG induction, conditions have been 
optimized to yield highly reliable data. In Figure 1, basal expression levels were 
plotted on the ordinate, induced levels on the abscissa. Panel A illustrates the 
results obtained when two Cy3-labeled probes were hybridized to duplicate whole 

20 genome array sets. Panel B represents an experiment in which the Cy5-labeled 
cDNA copy of control RNA and the Cy3-labeled copy of induced RNA were 
co-annealed to a single slide set. The RNAs used to generate the results in 
Panel B were each labeled with the other dye to allow a "reciprocal" 
hybridization. In Panel C, the resulting data were averaged with the data 

25 presented in Panel B to yield the scatter plot depicted in Panel C. A second 
independent set of RNA samples were isolated, their cDNAs labeled with both 
dyes and products hybridized in both possible combinations to generate the results 
depicted in Panel D. Panel E displays the averaged results of the two independent 
experiments depicted in Panels C and D. 

30 Reciprocal Labeling . When the results of a single hybridization experiment using 
different slide sets as capture reagents for Cy3-labeled cDNA derived from treated 
and control cells were plotted in log-log form, lacZYA induction above the 
background was detected (Figure 1A); variation of other genes was also 
significant as indicated by the width of the points falling along the diagonal of this 

35 scatter plot. Improvements were observed by labeling the control sample with 

Cy5 and the induced sample with Cy3 before hybridizing to a single set of 3 slides 
(Figure IB). However, there was a skewing of the data away from the abscissa 
and towards the ordinate (y-axis; Cy5-labeled probe). Averaging of these results 
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with others obtained using reciprocal copying of the same RNA samples (induced 
RNA reverse transcribed with Cy5 and control RNA with Cy3) resulted in a 
decreased variation between the treated and control samples (Figure 1C). Such 
"label swapping" lessened the skewing and decreased the scatter. The experiment, 
5 depicted in Figure 1C, was replicated; fresh cultures were induced and nucleic 
acids processed to yield the data depicted in Figure ID. The experiments shown 
in Figures 1C and ID each represent four measurements of individual transcript 
abundance; this repetition and averaging yielded the tight constellation shown in 
Figure IE which combined the data of Figure 1C and ID. Nonetheless, the scatter 

10 plot resulting from an experiment using the optimized protocol (Figure 1 E) 

illustrated that measurements of gene expression were still subject to considerable 
variation when the signal was in the lowest part of the detectable range. 

The effect of 1 mM IPTG upon expression of the arrayed genes was 
investigated. Duplicate RNA preparations of the control and IPTG treated cells 

15 were each labeled with Cy3 and Cy5 by first strand cDNA synthesis. Averaging 
of measurements gave an optimal reliability of the data (Figure 1). Examination 
of the extent of hybridization to any individual gene revealed a wide dynamic 
range with more than a thousand fold variation in signal intensity between genes 
(see Figure 1). The expression of only 8 genes increased by a factor of more than 

20 2 after exposure to 1 mM IPTG for 1 5 min (Figure 1 E). These induced genes are 
listed in Table 1 . Two-fold or greater repression was not observed after this 
treatment. The most highly induced RNAs corresponded to the lac operon 
structural genes. Examples of the induced genes are b0956 t melA, vxaA and 
M783. 

25 Signal Quantitation . The present invention was applied to monitor the effects of 
growth stage and medium on gene expression. For these embodiments, signal 
quantitation was important The percentage of RNA that programs protein 
synthesis has been determined under a wide variety of growth regimes (Bremer 
and Dennis, Escherichia colt and Salmonella: Cellular and Molecular Biology 

30 ASM Press: 922-937 ( 1 996)). The fraction of those protein-specifying transcripts 
devoted to each arrayed gene was estimated. Hybridization signals arising from 
annealing of RNA-derived Cy3-labeled cDNA populations were quantitated by 
dividing by the signal generated using Cy3 fluorescent DNA arising from copying 
of sheared E. coli genomic DNA as a probe. The probe synthesized by copying 

35 genomic DNA was used to approximate equimolar transcription of the entire 
genome. This quantitation allowed calculation of mRNA inventories. Three 
RNA samples were measured. The samples were isolated from cells growing 
exponentially in rich medium, from cells growing exponentially in minimal 
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medium, and from cells in minimal medium transitioning from exponential to 
stationary phase. RNAs from certain central metabolic (gapA, ptsH)> defense 
(ahpQ cspQ 9 DNA metabolic (hns), surface structure (acpP, ompACFT, lpp) 9 
translation (rplBCKLMPWX, rpmBCI, rpsACDHJNS, trmD,jusA t infC, tufAE), 
5 transcription (rpoAB), and unassigned (B4243) genes (Riley and Labedan, 

Escherichia coli and Salmonella: Cellular and Molecular Biology, ASM press: 
2118-2202 (1996)) were abundant (>0.1%, among the top 100 transcripts) in all 
three samples. 

The most highly transcribed genes in actively growing broth-cultured cells 

10 often encoded proteins involved in translation. In contrast, cultures at a similar 
growth stage in glucose minimal medium, expressed to a very high level several 
small molecule biosynthetic genes and the means to utilize glucose. Thus, an 
agreement between these molecular analyses and the accumulated understanding 
of E. coli physiology was observed (Escherichia coli and Salmonella: Cellular 

15 and Molecular Biology ASM press). This agreement was underscored in the 
analysis of cells transitioning from the exponential growth phase; the elevated 
expression of several rpoS-controlled genes corresponded to expectations 
(Escherichia coli and Salmonella: Cellular and Molecular Biology, ASM press). 
The genes, each representing between 0.0007% and 1% of the hybridizing 

20 signal, were expressed in LB grown cells. The distribution of genes as a function 
of expression level is plotted in Figure 2. Figure 3 depicts fractional expression as 
a function of summed genes with genes ranked by expression level. In Figure 2, 
the histogram plots the number of genes as a function of expression range. 
Diagonally striped, solid, and horizontally striped bars reflect distributions 

25 observed in RNAs derived from cells growing exponentially in minimal medium, 
cells transitioning to stationary phase in minimal medium, and cells growing 
exponentially in rich medium, respectively. In Figure 3, the fraction (summed 
open reading frame transcripts/total open reading frame transcripts) was plotted as 
a function of genes summed. The order in which genes were summed was based 

30 upon expression level with the most highly expressed gene summed first. 

Fewer genes were expressed in LB than in minimal medium (Figure 2); the 
fraction of rare transcripts appeared under-represented in LB medium (Figure 3). 
The fifty most highly expressed genes in broth-grown cells are listed in left-most 
columns of Table 2; twenty-six of these intensely transcribed genes encode 

35 proteins involved in translation while three encode chaperones. 

The broad distribution analyses (Figures 2 and 3) readily revealed the 
significant differences observed in expression of E. coli when grown in defined 
and rich media. In minimal media many more genes were transcribed over a 
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somewhat broader range. The 50 genes most highly expressed in exponentially 
growing cells cultured in minimal medium with glucose as a carbon/energy source 
are listed in the middle columns of Table 2. Eight biosynthetic genes were highly 
expressed (Table 2). Notable among them were metE, encoding the aerobic 
5 methionine synthase, and ilvC, an isoleucine-valine biosynthetic gene subject to 
feed-forward transcriptional activation (Umbarger, H.E, Escherichia coli and 
Salmonella: Cellular and Molecular Biology, ASM Press (1996) ) by its 
substrates. Both the //vC-encoded enzyme (Petersen et aL, Nucleic Acids Res. 
14:9631-9651 (1986)) and me/2s-encoded enzyme (Green, R. C, Escherichia coli 

10 and Salmonella: Cellular and Molecular Biology, ASM Press (1 996)) are sluggish 
catalysts. The metE product accounts for about 5% of E. coli protein when cells 
are cultured in minimal medium with glucose as a carbon/energy source 
(VanBogelen et aL, Escherichia coli and Salmonella: Cellular and Molecular 
Biology, ASM Press (1996)). Other highly expressed biosynthetic genes included 

15 folE and cysK; the folE product, GTP cyclohydrolase I catalyzes both cleavage of 
the 5-membered ring of guanine and the rearrangement of the ribose moiety of the 
substrate, GTP (Green et aL, Escherichia coli and Salmonella: Cellular and 
Molecular Biology, ASM Press (1996)). cysK r encoding o-acetylserine(thiol)- 
lyase isozyme A, is responsible for more than 90% of sulfur fixation under aerobic 

20 conditions (Kredich, N. M., Molecular Biology, ASM press (1996)). Transcripts 
of the pyrBI opcron encoding aspartate transcarbamylase also were highly 
expressed during exponential growth in minimal medium relative to a broth- 
grown culture. This expression level is a characteristic signature of strain 
MG1655 whose aspartate transcarbamylase content is elevated more than 100 fold 

25 when grown in the absence of uracil due to an rph mutation that is polar on pyrE 
(Jensen, K. F., J. BacterioL 181:3525-3535 (1993)). The other highly expressed 
transcripts, thrL and pheF, encoded, respectively, the threonine leader polypeptide 
(Landick et aL, Escherichia coli and Salmonella: Cellular and Molecular Biology, 
ASM Press (1996)) and the phenylalanine-inhibited first enzyme of the common 

30 aromatic pathway. The pheF product, one of three isozymes, is estimated to 
account for more than 80% of the activity catalyzing the first common step of 
aromatic amino acid synthesis (Pittard, A. J., Escherichia coli and Salmonella: 
Cellular and Molecular Biology, ASM Press (1996)). 

In this embodiment, expression of several genes catalyzing fueling 

35 reactions was also elevated. Unexpectedly, aceAB, encoding the glyoxylate shunt 
enzymes malate synthase and isocitrate lyase (Cronan and Laporte, Escherichia 
coli and Salmonella: Cellular and Molecular Biology, ASM Press (1996)), was 
highly expressed. Perhaps the TCA cycle functions in its branched state during 
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this phase of growth requiring the glyoxylate shunt for anapleurotic replenishment 
(Neidhardt et al., Physiology of the Bacterial Cell: A Molecular Approach, 
Sinauer Associates, Inc. (1990)). As expected, ptsHI transcripts encoding 
phosphotransferase sugar transport common components (Postma et al. , 
5 Escherichia coli and Salmonella: Cellular and Molecular Biology, ASM Press 
(1996)) also accumulated to a very high titer in glucose-minimal medium. 

The present invention was applied to monitor the transcripts of cells 
transitioning from exponential to stationary phase in defined, minimal medium. 
During this transition, significant changes in gene expression were expected and 

10 observed. Expressed gene levels were from 0.0023 to 1.6%. A total of 

1030 genes, of which 1 10 have a defined role, did not appear to be expressed. In ' 
this embodiment, the 50 most highly expressed genes during this transition are 
listed in the rightmost columns of Table 2. Significantly, several r/raS-regulated 
genes (Hengge-Aronis, Escherichia coli and Salmonella: Cellular and Molecular 

15 Biology, ASM press, 1497-1512) including hdeA (1 1 fold), hdeB (8.9 fold), dps 
(4.4 fold), gadA (8.2 fold) and gadB (12 fold) (Castanie-Cornet et al., J. Bacteriol 
181:3525-3535 (1999)) as well as rpoS (2.6 fold) itself became quite highly 
expressed. Despite this remodeling of transcription, the overall patterns of gene 
number as a function of expression level (Figure 2) and fractional expression as a 

20 function of ranked gene (Figure 3) were not as distinct as might have been 
expected in comparison to the patterns observed for RNA extracted from 
exponentially growing cells. 

The observed expression patterns are summarized in Table 3 where gene 
products were grouped by metabolic function using an established classification 

25 scheme (Riley and Labedan, Escherichia coli and Salmonella: Cellular and 

Molecular Biology, ASM Press (1996)). Exponential growth in minimal medium 
elevated the amount of pyrimidine and amino acid biosynthetic transcripts. In 
contrast cofactor and purine transcripts did not appear to accumulate relative to 
growth in broth. Expression of glyoxylate shunt and miscellaneous glucose 

30 transcripts was also elevated in minimal medium; the seven-fold elevation of 

glyoxylate shunt transcripts exceeded the average of that observed for amino acid 
biosynthetic mRNAs. Expression of genes involved in sulfur fixation was also 
elevated during growth in minimal medium. 

The rapid growth observed in LB was reflected in the gene expression 

35 profile, as was the difference in carbon energy/source between glucose and amino 
acids. LB-grown cultures displayed elevated expression of genes specifying 
glucogenic enzymes and of genes whose products degrade small molecules. 
Expression of the ATP and proton motive force generating machinery, elevated by 
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a factor of about 2, paralleled increased ribosomal protein, aminoacayl-tRNA 
synthetase and foldase/usher expression. 

Changes observed upon entering the transitional period between 
exponential and stationary phase growth were less dramatic. Nonetheless, 
5 elevation of mRN As specifying gluconogenic, glycolytic, and TC A cycle 
enzymes was observed as was an increase in transcripts-encoding enzymes 
responsible for metabolic pool interconversions and for the non-dxidative branch 
of the hexose monophosphate shunt. The cells also displayed an increased titer of 
foldase/usher-specifying and global regulatory function transcripts while 

10 transitioning between growth phases. 

The present invention was used to monitor the change in gene expression 
when cells overexpressd sdiA gene. The sdiA is a positive activator of an operon 
that includes ftsQAZ, genes essential for septation. 

RNA isolated from broth grown, exponential phase cultures harboring 

15 either a single copy (pUC19/RFM443) or many copies (pDEW140/RFM443) of 
sdiA were compared after conversion into fluorescently labeled cDNA by 
hybridization to individual genes arrayed on glass slides. 

Expression of about 9% of the & coli genes was elevated in the strain 
containing the multicopy sdiA plasmid (Table 4). Transcripts of seven genes 

20 involved in cell division were raised 2.1 to 1 1 fold by amplification of sdiA as 
were a large number (about 20) of genes involved in DNA replication, repair, and 
degradation. Transcript levels of eight genes whose products alter the 
susceptibility of E. coli to drugs were more highly expressed in the strain 
containing the gene amplification. This genetic configuration also resulted in 

25 elevated expression of several lipopolysaccharide biosynthetic genes (rfa) as well 
as open reading frames encoding membrane structural elements. 

Expression of several genes of unknown function was also elevated in 
response to the presence of multiple copies of sdiA. The genes whose transcripts 
were highly (>6 fold) elevated in response to the multicopy sdiA plasmid 

30 included: b0135 (6.4 fold, annotated as putative fimbrial-like protein gene), 

W225 (6.4 fold, a gene apparently co-transcribed with dinJ since between them 
there is only a 3 base pair intergenic region), b0157 (1 1 fold, encoding a putative 
malate dehydrogenase), b0530 (also known as sfinA and predicted to specify a 
fimbrial like protein was elevated 6.5 fold), b0712 (encoding a putative 

35 carboxylase had a 6.4 fold increase in transcript content) and bl438 (1 1 fold 
elevation in expression). 

Around 3% of the E. coli genes were repressed in a strain harboring the 
sdiA plasmid relative to the control strain containing the vector (Table 5). The 
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genes involved in chemotaxis, mobility, and flagella biosynthesis were repressed 
dramatically. Genes for transport of certain carbohydrate substrates and cations 
(Fe** and K + ), degradation of corresponding carbon compounds, as well as 
acetate fermentation were repressed. The presence of pDEW140, a pUC19 
5 derivative harboring sdiA, resulted in a 30-fold elevation in detectable sdiA 

transcript. Expression of sdiA was very low (0.0015 %, the 4212th most abundant 
transcript) in LB grown £. coli MG1655. The increased expression in the plasmid 
containing strain raised the transcript rank to about 300. 

Genes ddl ftsQ, ftsA, ftsZ and IpxC are organized in the order mentioned 

10 above in the complex ftsZ containing operon, and the above genes are transcribed 
in the same direction starting with ddl Since the j<fc4-encoded positive activator 
drives transcription of a mRNA including ddl, ftsQ, ftsA, ftsZ t and IpxC, increased 
quantities of RNA hybridizing to these genes were expected. Amplification of 
sdiA due to its presence on a multicopy plasmid elevated expression of ddl, ftsQ, 

15 ftsA, ftsZ and IpxC 4.6, 8.8, 10, 1 1 and 3.5 fold, respectively, relative to the strain 
that harbored pUC19 (Table 4). 

In the immediate down stream of sdiA, there are yecF, followed by uvrY 
and uvrC gene, respectively. wvr7 and uvrC genes are transribed in the same 
direction as sdiA and theyecF is transcribed in the opposite direction. 

20 Unexpectedly, amplification of sdiA elevated expression of two genes downstream 
of sdiA was observed. uvrY expression was elevated 12 fold while uvrC 
transcription was increased by a factor of 9 (Table 4). These two genes were 
transcribed in the same direction as sdiA. The expression of yecF decreased only 
slightly. 

25 Amplification of sdiA caused the expression of 101 genes to fall by a 

factor of 2 or more. Among them, 44 were involved in motility and chemotaxis. 
Thirty four genes were down regulated more than five-fold by sdiA amplification. 
Of these, thirty were involved in chemotaxis or motility (cheJV; 
flgB, QDiEMG.HJJ&LMM fliA t QE,F, G,H,J,LM,N t P,S, T f Z; tarandtsr). The 

30 master regulator genes flhC and D controlling flagella operon expression were 
lowered by only 30-38 %. 

The swarming of strains having single or multiple copies of sdiA was 
examined by spotting four single colony isolates of each strain on semi-solid 
medium. Since almost all the genes involved in flagella biosynthesis, chemotaxis 

35 and motility were dramatically repressed in the sdiA overexpression strain, loss of 
mobility of the sdiA overexpression strain was predicted. Experiments were 
carried out to compare the mobility of the two strains. After 8 hr. at 37°C, the 
strain containing pUC19 had swarmed (diameter =32± 2.5 mm) while that 
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containing pDEW140 (sdiA + ) had not (diameter =3.2+0.4 mm). After 23 h the 
pUCl 9 containing strain had filled the petri plate while the strain carrying the 
sdiA amplification had significantly swarmed covering about one half of each 
plate. This partial phenotype could be explained by either (a) plasmid loss 
5 allowing swarming of a revertant (sdiA + haploid) population as ampicillin was 
exhausted from the medium or (b) sdiA amplification only partially compromising 
motility. To distinguish between these possibilities, the site of inoculation and the 
edge of the swarm after 23 hr were streaked for single colonies to an ampicillin 
containing LB agar plate. Massive sdiA+ plasmid loss from cells at the edge of 

10 the swarm was not observed suggesting that the motility defective phenotype was 
not an absolute one. 

If the role of sdiA is to stimulate gene expression required for septation, 
sdiA might coordinate expression of the JfoZ-containing operon with action at the 
origin of replication, oriC. The two genes immediately flanking oriC are mioC 

15 and gidA. mioC is followed by asnC and asnA, and gidA is followed by gidB, atpl 
and atpB. All of the genes except asnA are transcribed in the same direction. 
gidA and mioC were over-transcribed relative to the vector-containing control 
strain. mioC transcript content was elevated 7 fold while those of the gidA and 
gidB genes were elevated 4 and 2 fold, respectively. This effect was most 

20 localized; adjoining genes were not over-expressed. 

Having found enhanced action around oriC, it was reasonable to examine 
the transcript content of genes surrounding the termini of replication when sdiA 
was amplified. There are multiple termini in E. coll The region surrounding terB 
spans minutes 35.3-37.3 (Berlyn et aL, Escherichia coli and Salmonella: Cellular 

25 and Molecular Biology ASM Press: 922-937 (1996)) sdiA amplification-elevated 
expression of 12 of the 88 genes in this region more than 3 fold. Transcripts from 
another 26 genes in the region were elevated by a factor of 1 .5 to 3. Unlike the 
action observed around the terminus, the stimulation seen in the vicinity of terB 
was diffuse. Interestingly, tau, encoding the terminus-utilizing factor, was not 

30 over-expressed. Transcription of gusR, located at 36.5 minutes, was elevated 
8 fold by sdiA amplification (Table 4). 

acr genes specify sensitivity to acriflavines, molecules that intercalate into 
double stranded DNA containing monotonic runs of base pairs. Most acr mutants 
display a defect in acridine efflux; moreover they are often pleiotropic being 

35 hypersensitive to a wide variety of chemicals. Thus hyper-expression of these 
genes in a strain harboring an sdt/4-bearing multicopy plasmid could lead to 
mitomycin C expulsion and the observed resistance to this DNA damaging agent. 
This expectation of acr hyper-expression was confirmed. Evidence for elevated 
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expression of each acr operon was found as indicated by the fold expression 
reported in Table 4. 

Elevated transcription of the gal operon genes at minute 17 was observed 
in the strain bearing the sdiA amplification. These genes, moderately expressed 
5 when strain MG1 655 was grown in LB medium (ranks: galE 84 1 , galT 1512, 
galK 599; Wei and LaRossa, unpublished), were elevated 3.8, 4.9 and 4.1 fold, 
respectively. Nearby, at minute 16 is the ybgUKL-nei region, ybg genes are 
organized as ybg F,ybgJ t ybgK and y&gl, in that order followed by net gene. 
These genes, transcribed in the same orientation, could constitute an operon since 

10 the open reading frames are densely packed, at times overlapping. sdiA 

amplification elevated expression of these genes 5.2, 4.7, 6.4, 3.8 and 8.6 fold, 
respectively, nei encodes an endonuclease responsible for the excision of 
oxidized pyrimidines in the double helix. 

Two linked genes at minute 44, bl956 and 67957 were elevated 6.6 and 

15 14 fold by sdiA amplification. Similarly, expression of b201 7 and b2016 y two 
genes at minute 45 divergently transcribed from and adjacent to the his operon, 
was elevated 3.8 and 3.5 fold, respectively by the presence of the j*fc4-containing 
multicopy plasmid. 

Mitomycin C (MMC) is a DNA damaging agent. E. colt strain, MG1655, 

20 was exposed to MMC, and gene expressions were compared in cells that were 
harvested at 15 and 40 min post exposure. In the cells that were harvested at 
15 min, very little SOS response was detected. At the 40 min, expression 
of40 genes was elevated greater than 2 fold relative to the control strain. Among 
the 40, 13 stress response genes were induced (Table 6) more than 2 fold. The 

25 SOS genes that were induced by a 40 min exposure to MMC were recN, dinl, 
sulA, lexA, recA, uvrA, dinD t priQ umuC, mioQ uvrB, ruvA, andxsed. 

The SOS responsive genes are /ex/l-dependent. In order to determine the 
gene expression patterns in the presence and the absence of the SOS response, 
DM800 and DM803 were exposed to MMC for 40 min and the gene expression 

30 profiles were compared. DM800 and DM803 harbor lexA* and lexA ind alleles, 
respectively. As expected, when exposed to MMC for 40 min, SOS responsive 
genes were induced greater than 2 fold in DM800 strain. SOS responsive genes, 
including lexA, were not induced in the DM803 strain (Tables 7 and 8). Many 
genes that were not induced by MMC in DM800 were induced by the DNA 

35 damaging agent in DM803. For examples, the expression of the following genes 
were induced greater than 2 fold in DM803 but not in DM800 (Tables 7 and 8): 
among the induced genes are those involved with cell division (i.e., dicB t dicC, 
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andsdiA); chemotaxis and mobility (i.e M cheW and motA); and the transport of 
small molecules (i.e., cycA t fadL, chaC, codB and btuC). 

The present invention is not limited to only highly expressed genes for 
several reasons. First, reproducible expression measurements were obtained over 
5 a wide dynamic range (Figure IE). Second, the data of Figure 3 and Table 1 

illustrate that the lac operon expression, although low before IPTG induction, was 
detected suggesting that most transcripts can be readily measured with the 
described techniques. Analyses of well-characterized "promoter-down" mutants 
or spiking experiments may be useful in defining the lower limits of expression 
1 0 that can be observed. 

EXAMPLES 

The present invention is further defined in the following Examples. It 
should be understood that these Examples, while indicating preferred 
embodiments of the invention, are given by way of illustration only. From the 

15 above discussion and these Examples, one skilled in the art can ascertain the 
essential characteristics of this invention, and without departing from the spirit 
and scope thereof, can make various changes and modifications of the invention to 
adapt it to various usages and conditions. 
GENERAL METHODS 

20 Standard recombinant DNA and molecular cloning techniques used in the 

Examples are well known in the art and are described by Sambrook, J., Fritsch, 

E. F. and Maniatis, T. Molecular Cloning: A Laboratory Manual; Cold Spring 
Harbor Laboratory Press: Cold Spring Harbor, (1989) (Maniatis) and by T. J. 
Silhavy, M. L. Bennan, and L. W. Enquist, Experiments with Gene Fusions, Cold 

25 Spring Harbor Laboratory, Cold Spring Harbor, N. Y. (1 984) and by Ausubel, 

F. M. et al., Current Protocols in Molecular Biology, pub. by Greene Publishing 
Assoc. and Wiley-Interscience (1987). 

The meaning of abbreviations is as follows: "hr" means hour(s), "min" 
means minute(s), "sec" means second(s), "d" means day(s), "mL" means 
30 milliliters), "pL" means microliter(s), "nL" means nanoliter(s), "^g" means 

microgram(s), "ng" means nanogram(s), "mM" means millimole(s), "n M " means 
micrpmole(s). 

Media and Culture Conditions: 

Materials and methods suitable for the maintenance and growth of 
35 bacterial cultures were found in Experiments in Molecular Genetics (Jeffrey H. 
Miller), Cold spring Harbor Laboratory Press (1972), Manual of Methods for 
General Bacteriology (Phillip Gerhardt, R.G.E. Murray, Ralph N. Costilow, 
Eugene W. Nester, Willis A. Wood, Noel R. Krieg and G. Briggs Phillips, eds), 
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pp. 210-213, American Society for Microbiology, Washington, DC. or Thomas D. 

Brock in Biotechnology: A Textbook of Industrial Microbiology^ Second Edition 

(1989) Sinauer Associates, Inc., Sunderland MA. All reagents and materials used 

for the growth and maintenance of bacterial cells were obtained from Aldrich 
5 Chemicals (Milwaukee, WI), DIFCO Laboraoties (Detroit, MI), Gibco/BRL 

(Gaithersburg, MD), or Sigma Chemical Company (St. Louis, MO) unless 

otherwise specified. 

LB medium contains following per liter of medium: Bacto-tryptone (10 g), 

Bacto-yeast extract (5 g), and NaCl (10 g). 
10 Minimal M9 medium contains following per liter of medium: Na 2 HP04 

(6 g), KH 2 P0 4 (3 g), NaCl (0.5 g), and NH 4 C1 (1 g). 

Above media were autoclaved for sterilization then 10 mL of 0.01 M 

CaCl 2 and 1 mL of MgS0 4 . 7H 2 0 plus carbon source and other nutrient were 

added as mentioned in the examples. All additions were pre-sterilized before they 
1 5 were added to the media. 

Molecular Biology Techniques : 

Restriction enzyme digestions, ligations, transformations, and methods for 

agarose gel electrophoresis were performed as described in Sambrook, J., et al., 

Molecular Cloning: A Laboratory Manual. Second Edition, Cold Spring Harbor 
20 Laboratory Press (1989). Polymerase Chain Reactions (PCR) techniques were 

found in White, B., PCR Protocols: Current Methods and Applications. Volume 

15(1993) Humana Press Inc. 

EXAMPLE 1 

Example 1 demonstrates genomic DNA amplification and the preparation 

25 of the high density DNA array. > 

Amplification of 4290 E. coli genes Specific primer pairs (available from Sigma 
Genosys Biotechnolgies, The Woodlands, TX) for each protein-specifying gene of 
£ coli were used in two consecutive PCR amplification reactions. Genomic DNA 
(30 ng) was used as the template in the first round of PCR amplification, and 

30 500-fold diluted PCR products served as templates for PCR re-amplification. 
Duplicate 50 |*L scale reactions were performed. The PCR reactions were 
catalyzed with ExTaq™ polymerase (Panvera, Madison, WI) with the four 
dNTPs (Pharmacia), present at 0.25 mM and the primers at 0.5 \iM. Twenty- 
five cycles of denaturation at 95°C for 30 sec, annealing at 64°C for 30 sec and 

35 polymerization at 72°C for 2 min were conducted. A 2 jiL aliquot of each PCR 
product was sized by electrophoresis through agarose gels. More than 95% of the 
second round PCR products displayed visible bands of the correct size. Second 
round PCR reactions devoid of templates and primers were saved to serve as 
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negative controls for hybridization capture reagents. One third of each second 
round PCR reaction was purified using 96-well PCR purification kits (Qiagen, 
Valencia, CA). The eluted DNAs were dried using a vacuum centrifuge. 
Arraying amplified genes . Twenty microliters of 6M Na2SCN or 50% DMSO 
5 was added to each dried DNA sample (> 0. 1 ng/nL). A generation II DNA spotter 
(Molecular Dynamics, Sunnyvale, CA) was used to array the samples onto coated 
glass slides (Amersham Pharmacia Biotech, Arlington Heights, IL). Aliquots of 
approximately 1 nL from 1536 resuspended PCR products were arrayed in 
duplicate on each slide; a set of three slides supported all amplified E. coli genes. 
10 To serve as controls, 76 specific R coli PCR products, 8 amplified genes of 
Klebsiella pnuemoniae and 12 plant cDNA clones were also spotted onto each 
slide. Spotted glass slides, after baking at 80°C for 2 hr., were stored under 
vacuum in a desiccator at room temperature. 

EXAMPLE 2 

15 Example 2 demonstrates gene expression analysis. R coli mRNA was 

isolated, fluorescent labeled cDNA was prepared using mRNA as a template, and 
the labeled cDNA was hybridized to the high density DNA array. The amount of 

r 

DNA hybridized to DNA array was quantitated and analyzed. 
Microbiological Methods . 

20 E. coli MG1655 was cultured with aeration in either the minimal medium, 

M9 (Miller, J. H., Experiments in Molecular Genetics, Cold Spring Harbor 
(1972)), supplemented with 0.4% glucose or in the rich medium, LB (Miller, J. H., 
Experiments in Molecular Genetics, Cold Spring Harbor (1972)), at 37°C. The 
overnight culture was diluted 250 fold into fresh medium and aerated by shaking 

25 at 37°C. Samples of the minimal medium culture were harvested at Agoo-0.40 
(exponential phase) and 1 .6 (transition to stationary phase) prior to RNA isolation. 
An IPTG induction (Miller, J. H., Experiments in Molecular Genetics, Cold 
Spring Harbor (1972)) was performed to examine the specificity with which it 
effects gene expression. A culture grown overnight in LB at 37°C was diluted 

30 250 fold into fresh LB and aerated at 37°C. When the culture achieved an 

appropriate density (A£oo = 0.40), it was split. To one portion was added IPTG to a 
final concentration of l.mM; the untreated sample served as a control. Incubation 
of both samples was continued with aeration at 37°C for another 15 min 
(A 6 oo=0.45 for both cultures) before RNA isolation was initiated. 

35 RNA Isolation . An equivalent volume of shaved ice was added to 50 mL samples 
which were pelleted immediately in a refrigerated centrifuge by spinning at 
10,410 x g for 2 min. Each resultant pellet was resuspended in a mixture 
containing 100 \xL of Tris HC1 (10 mM, pH 8.0) and 350 |iL of P-mercaptoethanol 
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supplemented RLT buffer [Qiagen RNeasy Mini Kit, Valencia, CA]. The cell 
suspension was added to a chilled 2 mL screwed-capped microfuge tube 
containing 100 |iL of 0.1 mm zirconia/silica beads (Biospec Product Inc., 
Bartlesville, OK). The cells were broken by agitation at room temperature for 
5 25 sec with a Mini-Beadbeater (TM) (Biospec Products Inc., Bartlesville, OK). 
Debris was pelleted by centrifugation for 3 min at 16,000 x g and 4°C; the 
resultant supernatant was mixed with 250 \iL of ethanol. This mixture was loaded 
onto a column from the Qiagen RNeasy Mini Kit. RNA isolation was completed 
using the protocol supplied with this kit Incubation for 1 hr. at 37°C in 40 mM 

10 Tris pH 8.0, 10 mM NaCl, 6 mM MgCl 2 with RNase free RQ1 DNase (1 unit/^iL, 
Promega, Madison, WI) digested any genomic DNA contaminating the RNA 
preparation. The digestion products were purified by a second passage through 
the RNeasy protocol (Qiagen, Valencia, CA). The product was eluted from the 
column in 50 jiL RNAse-free water prior to determining sample concentration by 

15 an A 2 60 reading. RNA preparations were stored frozen at either -20 or -80°C. 

Synthesis of fluorescent cDNA from total RNA . Six microgram of RNA template 
and 12 |ag of random hexamer primers (Operon Technologies, Inc., Alameda, CA) 
were diluted with double distilled (dd) water to a volume of 22 |iL. Annealing 
was accomplished by incubation at 70°C for 10 min followed by 10 min at room 

20 temperature. In order were added: 8 jiL of 5x Superscript II reaction buffer (Life 
Technologies, Inc., Gaithersberg, MD), 4 [iL of 0.1M DTT, 2 of the dNTP mix 
(2 mM dATP, 2 mM dGTP, 2 mM TTP, 1 mM dCTP), 2 nL of 0.5 mM Cy3- or 
Cy5-dCTP (Amersham Pharmacia Biotech, Arlington Heights, IL), and 2 |iL of 
Superscript II reverse transcriptase (200 units/mL, Life Technologies Inc., 

25 Gaithersberg, MD). DNA synthesis proceeded at 42°C for 2.5 hr. before the 

reaction was terminated by heating at 94°C for 5 min. Alkaline hydrolysis of the 
RNA templates was achieved by adding 2 jjL of 5M NaOH followed by 
incubation at 37°C for 10 min. Hydrolysis was terminated by the sequential 
addition of 3 nL of 5M HC1 and 5 ^iL of 1M Tris-HCl, pH 6.8. The labeled 

30 cDNA was purified with a PCR purification kit (Qiagen, Valencia, CA), dried in a 
speed vacuum and stored at -20°C. Labeling efficiency was monitored using 
either A550, for Cy3 incorporation, or A550, for Cy5 labeling, to A 2 6o ratios. 
Fluorescent labeling of genomic DNA. Genomic DNA, isolated from strain 
MG1655 (Bachmann, B., Escherichia coli and Samonella: Cellular and 

35 Molecular Biology, ASM Press (1996)) by standard procedures (Van Dyk and 

Rosson, Methods in Molecular Biology: Bioluminescence Methods and Protocols, 
^ Humana Press Inc. (1 998)), was nebulized to approximately 2 kb pair fragments. 
Three microgram of this DNA was mixed with 6 \ig of random hexamers primers 
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(Operon Technologies, Inc., Alameda, CA) in 33 \xL of dd water. DNA was 
denatured by heating at 94°C prior to annealing on ice for 10 min. Fluorescent 
copying of the genomic DNA was accomplished using the Klenow fragment of 
DNA polymerase I (5 [ig/ |iL, Promega, Madison, WI). To the DNA mixture was 
5 added 6 of lOx Klenow buffer (supplied with the enzyme), 3 of the dNTP 
mix described above, 12 dd H 2 0, 3 jiL of 0.5 mM Cy3-dCTP (Amersham 
Pharmacia Biotech, Arlington Heights, IL), and 3 \xL of the Klenow fragment of 
DNA polymerase I. After a static, 2.5 h incubation at room temperature, the 
labeled DNA probe was purified using a PCR purification kit (Qiagen, Valencia, 

10 CA) before drying in a speed vacuum. 

Hybridization and washing . Spotted slides were placed in isopropanol for 10 min, 
boiled in dd H 2 0 for 5 min and dried by passage of ultra-clean N 2 gas prior to pre- 
hybridization. The prehybridization solution (PHS) was 3.5xSSC (BRL, Life 
Technologies Inc., Gaithersberg, MD), 0.2% SDS (BRL, Life Technologies Inc., 

15 Gaithersberg, MD), 1% bovine serum albumin (BSA, Fraction V, Sigma, St. 

Louis, MO). The hybridization solution (HS) contained 4 jaL of dd water, 7.5 |iL 
of 20xSSC, 2.5 nL of 1% SDS (BRL, Life Technologies Inc., Gaithersberg, MD), 
1 of 10 mg/ml Salmon sperm DNA (Sigma, St. Louis, MO) and 15 |iL of 
formamide (Sigma, St. Louis, MO). The slides were incubated at 60°C for 20 min 

20 in PHS. The slides were next rinsed 5 times in dd water at room temperature and 
twice in isopropanol before drying by the passage of nitrogen. The dried probe 
was resuspended in the HS and denatured by heating at 94°C for 5 min. 
Thirty microliter of the probe-containing HS was applied to a dried, 
pre-hybridized slide, covered with a cover slip (Corning, Corning, NY), and put 

25 into a sealed hybridization chamber containing a small reservoir of water to 
maintain moisture. Hybridization occurred for approximately 14 h at 35°C. 
Cover slips were removed in washing buffer I (WB I = 2xSSC, 0.1% SDS) 
warmed to 35°C prior to incubation for 5 min. Next, the slides were washed 
sequentially for 5 min in lxSSC, 0.1% SDS and O.lxSSC, 0.1% SDS. Slides were 

30 then passed through three baths, each passage lasting 2 min, in 0. 1 xSSC. The 
slides were dried with a nitrogen gas flow. 

Data Collection and Analysis . Hybridization to each slide was quantified with a 
confocal laser microscope (Molecular Dynamics, Sunnyvale, CA) whose 
photomultiplier tube was set to 700 volts and 800 volts for obtaining Cy3 and Cy5 
35 signals respectively. The images were analyzed with Array Vision 4.0 software 
(Imaging Research, Inc., Ontario, Canada). The fluorescent intensity associated 
with each spotted gene was reduced by subtracting the fluorescence of an 
adjoining, non-spotted region of the slide. These readings were exported to a 
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spreadsheet for further manipulation. The four "no DNA" spots derived from 
PCR reactions devoid of template were controls used to determine the noise 
(background signal) level. 

The 96 genes present on each slide were used as internal controls to 
5 quantify signal intensities yielding equivalent readings among the three slides of a 
whole genome array set. This corrected for slide-to-slide signal variation. 

For the IPTG induction experiment, it was presumed that the overall 
transcriptional pattern did not change significantly. Thus the summed equivalent 
reading for the entire genome was quantified; analogous quantitation of the 

10 underlying equivalent readings allowed calculation of fold induction of each 
gene's expression by comparison of such quantified equivalent readings. 
RNA abundance . To convert normalized equivalent readings into measures of 
transcript abundance, a further correction was needed. That correction required 
the hybridization signal arising from an equimolar concentration of all transcripts. 

15 The surrogate for this transcript pool was the fluorescent copy of genomic DNA. 
Thus, the fluorescent intensities from hybridization with RNA-derived probes 
were corrected using fluorescent intensities arising from genomic DNA derived 
probes. Specifically, the abundance of each gene's transcription product(s) was 
determined by dividing the normalized equivalent reading of the genomic DNA 

20 derived sample into the normalized equivalent reading from the RNA derived 
sample. The convention of Riley (Riley and Labedan Escherichia coli and 
Salmonella: Cellular and Molecular Biology ASM Press, 1996)) was followed in 
grouping genes into functional sets. 

EXAMPLE 3 

25 Example 3 demonstrates gene expression profile changes when cell were 

exposed to IPTG, or grown in different culture media. The results are illustrated 

in Tables 1,2 and 3 (Listing of Tables) as described above. 

IPTG Induction An E. coli strain MG1655 was grown overnight in LB at 37°C. 

The culture was diluted 250 fold into fresh LB and aerated at 37°C. When the 
30 culture achieved an appropriate density (A600 =0 - 40 )> lt was s P lil int0 1™ 0 

portions. 

To one portion, IPTG was added to a final concentration of 1 mM. The 
other portion was untreated and served as a control. 

Both samples was incubated with aeration at 37°C for another 15 min 
35 (A600 == 0-45 for both cultures) before RNA isolation. Gene expression analysis 
was performed as described in Examples 1 and 2. 

Cells were grown in different culture media E. coli MG1655 was cultured 
with aeration overnight in either the minimal medium, M9, supplemented with 
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0.4% glucose or in the rich medium, LB at 37°C. The overnight culture was 
diluted 250 fold into fresh medium and aerated by shaking at 37°C. Samples of 
the minimal medium culture were harvested at A 6 o<r 0 - 40 (exponential phase) and 
^00=1 .6 (transition to stationary phase) prior to RNA isolation. The LB culture 
5 was harvested at A 600 =0.4 prior to RNA isolation. Gene expression analysis was 
performed as described in Examples 1 and 2. 

EXAMPLE 4 

Example 4 demonstrates gene expression changes and the effect on 
mobility when sdiA gene was overexpressed in E. coli. The results are tabulated 
10 in Tables 4 and 5 (Listing of Tables) as described above. 

The following plasmids and strains were used in this example. 



strain or plasmid genotype 

MG1655 rph-1 

RFM443 rpsL galK2 lacA74 

pUC19 Cloning vector 

pDEW140 pUC19 + sdiA (EcoRI) 

Strains and growth conditions 

15 Strains of MG1655 (Bachmann, B., Escherichia coli and Samonella: 

Cellular and Molecular Biology, ASM Press (1996)) and RFM443 (Menzel R., 
Anal Biochem., 181:40-50 (1989)) have been described. 

pDEW140 was constructed as following: Chromosomal DNA isolated 
from E. coli W31 10 was partially digested with restriction enzyme Sau3 Al and 

20 size fractionated on agarose gels. Fractions of two size ranges (average sizes of 
approximately 2.5 and 4.0 Kbp) were ligated to pBR322 (0.1 1 pmol) or pUCl 8 
(0. 1 1 pmol) that had previously been digested with restriction enzyme BamUi and 
treated with calf intestinal alkaline phosphatase. The molar ratio of chromosomal 
DNA to vector in each of the ligation reactions was approximately 0.2: 1 . The 

25 ligation products were used to transform ultracompetent E. coli XL2Blue 
(Stratagene) to ampicillin resistance. Pooled transformants (>10 5 for each 
transformation) were used to isolate plasmid DNA. 

0.3 ng of the pUC18 library was electro-transformed into RFM443. The 
MMC resistant clones were selected on LB agar plates supplemented with 

30 1 00 ng/mL of ampicillin and 6 mg/mL of MMC. Resistant colonies appeared 
after the incubation at 37°C. The colonies underwent single colony purification 
on the same medium. Plasmids derived from single colonies were isolated with 
the Qiagen 96-well turbo plasmid prep kit. These plasmids served as a template 
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for primer-directed DNA sequencing of the insert ends. One of the plasmids, 
Plasmid p[3+4/B10], was shown by sequencing to carry the sdiA and surrounding 
genes. From this plasmid sdiAvtas amplified by PCR using the primers: 
f primer - TGGCA CGCAG GACAG AA (SEQIDNO:!) 
5 d primer = TAACA AATCA GC ATA ACTC A T (SEQIDNO:2) 

The PCR used Ampli-Taq Gold. Conditions were 94°C, 1 1 min followed 
by 32 cycles of 94°C for 45 sec, 45°C for 45 sec, 72°C for 90 sec, the 72°C for 
7 min. 

The PCR product was blunt end ligated into EcoRV digested pT7Blue-3 

10 (Novagen). A clone having the proper sized fragment was obtained after 

transformation into DH5-alpha. From colonies, inserts of the proper size were 
detected by PCR-based analysis. Such colonies served as a source of plasmid 
DNA from which sdiA was liberated by digestion with iscoRI. The fragment was 
sized by electrophoresis through agarose gels and ligated into EcoRI digested 

15 pUC 1 9. The ligation mixture was used to tranform DHSalpha. Plasmid preps of 
the transformants were sequenced. One such plasmid containing sdiA was named 
pDEW140 and transformed into strain RFM443. 

Plasmids pUC19 and pDEW140 were transformed into RFM 443 selecting 
for ampicillin resistance on solidified LB agar medium. 

20 Strains of RFM443 (pUC19) and RFM443 (pDEW140) were grown 

overnight with aeration in LB with 150 ^ig/mL ampicillin (LB with amp). The 
overnight culture was diluted 250 fold into fresh medium (LB with amp) and 
incubated further at 37°C with shaking. Cells were collected at O.D.600=0.45, 
and total RNA was purified using Qiagen RNeasy mini. 

25 Motility experiment . 

A single colony was picked from freshly grown RFM443 (pUC19) or 
RFM443 (pDEW140) cultured on LB agar (1.2%), and the center of a LB with 
amp soft agar (0.3%) plate was stabbed. The soft agar plate containing each 
culture was incubated at 37°C. The diameters of the growth zones of the two 

30 strains were measured and compared. 

EXAMPLES 

Example 5 demonstrates the differences in gene expression profile 
between strains proficient or deficient in their ability to respond to DNA 
damaging agents. An isogenic pair of strains, differing only in lexA, was used to 
35 investigate the cell's range of responses to the DNA damaging agent mitomycin C 
(MMC). the results are tabulated in Tables 6, 7, and 8 (Listing of Tables) as 
described above. 
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Strains E. coli strain, MG1655, was used to determine the gene expression profile 
of E. coli in response to a MMC challenge. Two isogenic E. coli strains (Mount 
et al, J. Bacteriol. 1 12:886-893 (1972)), DM800 (lexA + ), used as control 
displaying a normal response to DNA damage, and DM803 (lexA ind ), a strain 
5 unable to mount the predominant "SOS" response to DNA damage, were 
compared using comprehensive gene expression profiling. 
MMC experiment MG1 655 cells were grown in LB overnight with aeration. The 
overnight cultures were diluted 100 fold in LB to final volume of 500 mL and 
grown at 37°C to exponential phase. 200 mL of culture was treated with MMC to 

10 the final concentration of 250 ng/mL. Another 200 mL of culture were mock 

treated without MMC for comparison. Cells were harvested at 1 5 min and 40 min 
for MG1655 strain. With DM800 and DM803 stains, cells, cultured in an 
identical manner, were harvested after 40 min exposure. RNA was isolated and 
gene expression profile was analyzed as shown in Examples 1 and 2. As seen in 

15 Tables 7 and 8, the lexA allele has a great influence on the response of cells to 
MMC. Table 8 shows that the strain deficient in SOS response still response to 
MMC but in different manner. 

EXAMPLE 6 

Preparation of a Svnechocvstis sp. PCC6803 cDNA Probes 

20 This example describes the construction of Synechocystis sp. PCC6803 

cDNA probes following growth of the cells in either minimal growth media 
(control) or minimal media plus UV-B light treatment. The prepared cDNA 
probes are used to determine gene expression patterns of many genes 
simultaneously on a Synechocystis sp. PCC6803 DNA microarray as described in 

25 Examples 7 and 8 below. 

Hybridization of Microarray Slides and Quantitation of Gene Expression 

Microarray glass slides were treated with isopropanol for 10 min, boiling 
double distilled water for 5 min, then treated with blocking buffer (3.5 x SSC, 
0.2% SDS, 1% BSA ) for 20 min at 60°C, rinsed five times with double distilled 

30 water, then twice with isopropanol, followed by drying under nitrogen. Cy3 
labeled cDNA probes prepared from the total RNA of the UV-B treated 
Synechocystis culture, mixed with an equal amount of Cy5 labeled cDNA probes 
prepared from the total RNA of the untreated Synechocystis culture, were applied 
to the glass slide in a total volume of 30 \iL. The hybridization was repeated 

35 using Cy5 labeled cDNA probes prepared from total RNA of UV-B treated 

Synechocystis culture mixed with an equal amount of Cy3 labeled cDNA probes 
prepared from the total RNA of the untreated culture, and applied to a second 
glass slide in a total volume of 30 \iL. The hybridization reactions on the glass 
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slides were performed for 16 hr at 42°C, in a humidified chamber. Hybridized 
slides were washed in IX SSC (0.15 MNaCI, 0.015 M sodium citrate), 0.1% SDS 
for 5 min at 42°C; 0.1X SSC, 0.1% SDS for 5 min at 42°C; three washes in 0.1X 
SSC for 2 min at room temperature; rinsed with double distilled water and 
5 isopropanol; and dried under nitrogen. The slides were scanned using a Molecular 
Dynamics laser scanner for imaging of Cy3 and Cy5 labeled cDNA probes. The 
images were analyzed using Array Vision Software (Molecular Dynamics, 
Imaging Research) to obtain fluorescence signal intensities of each spot (each 
ORF on the array) to quantitate gene expression. The ratio between the signals in 
10 the two channels (redrgreen) is calculated and the relative intensity of Cy5/Cy3 
probes for each spot represents the relative abundance of specific mRNAs in each 
sample. 

Svnechocvstis Strain and Culture Methods 

Briefly, Synechocystis sp. PCC6803 cells were grown at 30 ^ES^nr 2 light 

15 intensity in a minimal growth media, BG-1 1 (Catalog # C-3061, Sigma Chemical 
Co., St. Louis, MO) at 30°C, with shaking at 100 rpm with 5% C0 2 . 
Fifty milliliters of Synechocystis cells grown to mid logarithmic phase (OD73 0nm 
= 0.8 to 1 .0) were divided into two 25 mL cultures and transferred from the 
Erlenmeyer growth flask to two 100 mL petri dishes. The petri dishes, with the 

20 lids on, were placed on a rotary shaker and shaken at 100 rpm. 
Cell Treatments 

For the control, the petri dishes comprising the Synechocystis cells were \ 
placed on a rotary shaker with the lids on, and shaken at 100 rpm. For the UV-B 
treated group, the petri dishes comprising the Synechocystis cells were placed on a 

25 rotary shaker with the lids on, and shaken at 1 00 rpm. A UV-B lamp (302 nm,) 
was positioned above the petri dishes and the distance between the UV-B light 
source and the petri dishes was adjusted to give the desired level of UV-B light 
intensity. The level of UV-B light intensity was measured at the surface of the 
cell culture using a UV light meter, following the manufacturer's instructions. 

30 UV-B treatment was performed for either 20 min or 120 min. Following UV-B 
irradiation, the cells were immediately cooled on ice and their RNA isolated as 
described below. 

Total RNA Isolation and cDNA Probe Synthesis 

Control-treated Synechocystis cells and UV-B treated Synechocystis cells 
35 were cooled rapidly on ice and centrifuged at 4000 ipm for 5 min. Total RNA 
samples were isolated using Qiagen RNeasy Mini Kit (Qiagen), following the 
manufacturer's protocol. RNase A digestion was performed as described in the 
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protocol, and a second round purification was performed using the RNeasy Mini 
Kit. The purified total RNA was analyzed by agarose gel electrophoresis. 

From each total RNA preparation, both Cy3 and Cy5 florescent dye 
labeled cDNA probes were prepared. To synthesize the Cy3 or Cy5 labeled 
5 cDNA probes, a reverse transcription reaction was performed using 1 0 |ig total 
RNA, 12 ^ig random hexamer (Ambion), 50 ^M of dATP, dGTP, dTTP, 25 
of dCTP, and 15 |iM Cy3-dCTP or 22 ^M Cy5-dCTP (Amersham Pharmacia 
Biotech), DTT, and AMV reverse transcriptase (Gibco BRL). The reaction was 
carried out at 42°C for 2.5 hr. After the labeling reaction, RNA templates were 

10 degraded by alkaline hydrolysis and the cDNA probes were purified using Qiagen 
PCR purification kit. The purified probes were quantitated by measuring the 
absorbance at 260 nm, 550 nm (Cy5 dye incorporation) and 650 nm (Cy3 dye 
incorporation). Prior to hybridization, 100-200 pmol of the purified Cy3 or Cy5 
labeled cDNA probes were dried under vacuum, and re-dissolved in the 

15 hybridization buffer (5x SSC, 50% formamide, 0.1% SDS, and 0.03 mg/mL 
salmon sperm DNA). 

EXAMPLE 7 

Analysis of Synechocystis sv. PCC6803 Gene Expression in Minimal Media 
Using a Synechocystis sp. PCC6803 DNA microarray prepared according 
20 to the methods described above and the cDNA probes prepared as described in 
Example 6, Applicants have identified herein promoters that can be employed for 
engineering high levels of gene expression in Synechocystis sp. PCC6803, other 
Synechocystis species, Synechococcus 9 and like organisms. This Example 
describes the identification of the most highly expressed genes and their 
25 corresponding strong promoters in Synechocystis sp. PCC6803 when grown in 
BG1 1 media containing 5 mM glucose as described above. 

Specifically, a DNA microarray was prepared according to the methods 
described above using DNA isolated from Synechocystis sp. PCC6803 cells 
grown in BG1 1 media containing 5 mM glucose. Minimal media Synechocystis 
30 sp. PCC6803 gene expression was determined by hybridizing this DNA 

microarray as described above with fluorescent cDNA probes synthesized from 
total RNA isolated from Synechocystis sp. PCC6803 cells grown in BG1 1 media 
containing 5mM glucose as described in Example 6. 

Briefly, for each minimal media experiment, two hybridization reactions 
35 were performed as described above. Specifically, the first reaction used equal 
molar (typically 100-200 pmol) of Cy5-labeled cDNA from total RNA of the 
minimal media treated sample, and Cy3-Iabeled cDNA probes synthesized from 
Synechocystis sp. PCC6803 genomic DNA; the second reaction used Cy3-labeled 
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cDNA from total RNA of the minimal media treated sample, and Cy5-labeled 
cDNA probes synthesized from Synechocystis sp. PCC6803 genomic DNA. The 
signal intensities were quantitated as described above. To calculate the ratio of 
fold induction (i.e., minimal media/genomic), the minimal media treated sample 
5 signal intensities were divided by the signal intensities of the genomic sample. As 
there were two sets of data from duplicated spotting within each slide, the total 
number of gene expression measurements for each gene was four. All four 
induction ratios for each gene were analyzed using an Excel program (Microsoft) 
to determine the standard deviation; an indicator of the level of confidence for the 

10 specific data set for each gene. The ratio of signal intensities represents a relative 
transcription level of each gene in the same experiment. Herein, Applicants have 
identified the most highly expressed genes, i.e., those genes that are under the 
control of the strongest promoters, in Synechocystis under this minimal media 
condition (see Table 9). 

15 EXAMPLE 8 

Analysis of Synechocystis sv. PCC6803 Gene Expressio n Following UV-B 

Exposure 

Using a, Synechocystis sp. PCC6803 DNA microarray prepared according 
to the methods described above and the probes prepared as described above in 

20 Example 6, Applicants have identified herein UV-B inducible promoters that can 
be employed for engineering high levels of gene expression in Synechocystis sp. 
PCC6803, other Synechocystis species, Synechococcus, and like organisms. This 
Example describes the identification of the most highly UV-B responsive genes in 
Synechocystis sp. PCC6803 when grown under minimal media conditions and 

25 exposed to 20 minutes of UV-B irradiation at 20 yES^nr 2 intensity. These UV 
inducible promoters can be used to control expression of certain proteins that may be 
toxic to Synechocystis cells. 

Specifically, a DNA microarray was prepared according to the methods 
described above using DNA isolated from Synechocystis sp. PCC6803. For each 

30 UV-B treatment experiment, two hybridization reactions were performed as 
described above. In particular, the first reaction used equal molar (typically 
100-200 pmol) of Cy5-labeled cDNA from total RNA of the UV-B treated 
sample, and Cy3-labeled cDNA from total RNA of the control sample 
{Synechocystis sp. PCC6803 grown in BG1 1 media containing 5 mM glucose); 

35 the second reaction used Cy3-labeled cDNA from total RNA of the UV-B treated 
sample, and CyS-labeled cDNA from total RNA of the control sample. The signal 
intensities were quantitated as described above. To calculate the ratio of fold 
induction (i.e., UV-B/control), the UV-B treated sample signal intensities were 
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divided by the signal intensities of the control sample. As there were two sets of 
data from duplicated spotting within each slide, the total number of gene 
expression measurements for each gene was four. All four induction ratios for 
each gene were analyzed using an Excel program (Microsoft) to determine the 
5 standard deviation; an indicator of the level of confidence for the specific data set 
for each gene. 

Applicants have identified herein the most highly UV-B induced genes in 
Synechocystis following UV-B treatment (see Table 10). Only genes whose 
expression was induced more than 4 folds by UV-B light (20 min at 20 nES _, m- 2 

10 intensity) as compared to the minimal media control are listed in Table 1 0. The 
promoters of these genes can be used to construct UV inducible expression 
vectors in Synechocystis. 

Some of the gene families induced by UV-B light include Dl protein 
(psbA), phycobilisome degradation proteins (nblA, nblB), carotenoid biosynthesis 

15 enzymes (crtD, crtD, crtQ), chaperones (clpB, ctpA, dnaJ, dnaK, htpG, hspl7), 
RNA polymerase sigma factor (rpoD), superoxide dismutase (sodB), high light 
inducible protein (hliA), FtsH protease, which is responsible for the degradation 
of photo-damaged Dl protein (ftsH), and DNA repair enzyme (uvrC). Among the 
group of UV inducible genes, there are several genes of unknown function: 

20 ssr2016, and sllOl 85. Applicants' discovery has lead to the first level of 

functional assignment for these genes. The promoters of these genes can be used 
to construct UV inducible expression vectors in Synechocystis. 

A subgroup of Applicants' identified UV-B induced genes comprise two 
Escherichia coli-likc -35 promoter sequences in the 5' upstream untranslated 

25 regions (UTR), including slrl 604 (ftsH), slr0228 (ftsH), sill 867 (psbA3), slrl 3 1 1 
(psbA2), ssl0452 (nblA), ssl0453 (nblA), ssl2542 (hliA), ssr2016 (unknown 
protein with homologues in green algae and plant), and sll0185 (unknown 
protein). The nucleotide sequence "GTTACA" is present in the 5' untranslated 
regions of psbA2, psbA3, and ssr2016 nucleic acids. The nucleotide sequence 

30 M TTTACA" was also found to be present in the 5' UTR regions of psbA2, psbA3, 
ssr2016, rpoD, and ndhD2 nucleic acids. 
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T.TSTING OF TABLES 



Table 1 







LB 






MM exD. phase 


MM transition 
















phase 








fold IPTG 




ranK*' 










gene 


function 


induction 


fn a 


fn 


rank 


fn 


rank 


lacA 


thiogalactoside 


JO 




3747 


2.46E-07 


4244 


2.09E-05 


3816 




acyltransferase 














3849 


lacZ 


galactosidase 


29 


8.88E-05 


2420 


2.16E-05 


3879 


1.98E-05 


lacY 


galactoside 


14 


6.07E-05 


3125 


6.10E-06 


4202 


1.63E-05 


3975 


b2324 


permease 
peptidase? 


5.3 


2.78E-04 


621 


7.30E-05 


2639 


5.43E-05 


2717 


nxaA 


altronate 


4.0 


2.93E-04 


575 


7.82E-05 


2530 


9.03E-05 


1990 




hydrolase 














136 


b!783/ 




3.6 


3.71E-04 


401 


3.23E-04 


576 


1.04E-03 


yeaG 
melA 


galactosidase 


2.9 


4.05E-05 


3729 


1.36E-05 


4050 


1.65E-05 


3966 


b0956/ 


hydrogenase? 


2.5 


2.63E-04 


678 


1.41E-04 


1573 


1.27E-04 


1529 


ycbG 



















5 a - fraction of particular transcript/summed transcripts hybridizing to all open reading frames on the micro-arrays; 
0 - genes are ranked in order of expression with 1 being the most highly expressed gene 
MM: Minimal media, 
exp phase: exponential growth phase 

10 

Table 2. Highly Expressed Genes under Three Different Culture Conditions 



name 


fraction 3 in LB 


name 


fraction in 


name 


fraction in 








minimal (exp. 




minimal 








phase) 




(transition) 


inf& 


0.0070 


cspA 


0.0054 


hdeA 


0.016 


rplk 


0.0068 


metE 


0.0050 


hdeB 


0.0099 




0.0066 


tufB. 


0.0048 


rmf 


0.0083 


rM 


0.0048 


ompA 


0.0046 


dps 


0.0065 


hemK 


0.0047 


ilvC 


0.0042 


Ipp 


0.0063 


rpml 


0.0046 


rmf 


0.0038 


ompC 


0.0059 


rplW 


0.0044 




0.0038 


icdA 


0.0059 


rpU 


0.0043 


ompT 


0.0037 


metE 


0.0053 


acpP 


0.0042 




0.0036 


gapA 


0.0049 


Ipp 


0.0040 


ahpC 


0.0034 


!Hf£ 


0.0049 


zW 


0.0039 


KPM 


0.0031 


ompA 


0.0044 


fusA 


0.0039 


ptsff 


0.0031 




0.0044 


gatB 


0.0038 


aceB 


0.0030 


uspA 


0.0040 


rpsF 


0.0038 


Ipp 


0.0029 


t»M 


0.0039 


&18 


0.0037 


rpsJ 


0.0028 


ilvC 


0.0037 


ompC 


0.0037 


cirA 


0.0028 


rpsN 


0.0036 


mopB 


0.0035 


gapA 


0.0026 


eno 


0.0036 


atpF 


0.0035 


rpml 


0.00266 


ahpC 


0.0035 


hns 


0.0035 


yjjs 


0.0026 


ompT 


0.0033 


rpmB 


0.0034 


rpmC 


0.0024 


zadA 


0.0033 
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ompA 

tnaA 

rpoA 

frmf?. 

rpk 
gapA 

rj?!M 



rplT 

mix. 

priB 

ompF 

hupA 



gatA 

rpsA 

gatY 

rpsS 

ppa 

gatZ 

cspE 

cspC 

mopA 



0.0033 


fysA 


U.0Uz4 


accB 


U.vv/J J 


0.0033 


b2745 


0.0024 




A t\(\tl 
U.UUjx 


0.0033 


ompF 


0.0023 


A.* A 
JUSA 


A A AIT 


0.0032 


cspC 


0.0022 


ptSti 


A AA7 ^ 


0.0031 


aceA 


0.002 1 


rpsD 


A AAOA 
0.0029 


0.0030 


pvrB 


0.0021 


OS J 12 


A AAOO 


0.0028 




0.0021 


gpmA 


A AAOQ 
U.UU25 


0.0028 


rpsD 


0.002 1 


tnetK 


A AAOfi 


0.0027 


cvsK 


0.0020 


rpmC 


A A AO 7 


0.0027 


ptsl 


0.0020 


JO 

zaaB 


A AMI 

0.0027 


0.0027 


bl452 


0.0019 


rpsV 


A AAOT 

0.0027 


0.0026 




0.0019 


cvsK 


A AAOiC 

U.0020 


0.0025 


fepA 


0.0019 


rpsJ 


a AAOC 


0.0025 


pvrl 


0.0018 


rpsri 


a AAO^ 


0.0025 


aroF 


0.0018 




0.002J 


0.0025 




0.0017 


aceA 


A AAO< 


0.0025 


rpsN 


0.0017 


02266 


A A AO"! 
0.0U23 


0.0024 


b0805 


0.0017 


rplM 


0.0023 


0.0024 


ompC 


0.0017 


rpsS 


0.0023 


0.0024 


KP$d 


0.0017 


nlpD 


A AAOO 


0.0024 


thrL 


0.0017 


acpP 


0.0022 


0.0024 


rpJX 


0.0016 


rpml 


A AAOO 

U.UU22 


0.0024 


rp)i 


0.0016 


ZEQS 


0.0021 


0.0023 


rpsM 


A AA1 £L 


rpoA 


A AA9ft 


0.0023 


w4l"48 


0.0016 


hns 


0.0020 


0.0022 


rpJM 


0.0016 


b4253 


0.0020 


0.0022 


w079S 


0.0016 


rpJM 


0.0020 


0.0021 


folE 


0.0015 


b!452 


0.0019 


0.0021 


icdA 


0.0015 


b0817 


0.0019 


0.0021 




0.0015 


bW03 


0.0019 



reading frames on the micro-arrays 

bold, double underlined -foldase/usher genes; bold, underlined - stress responsive genes; bold - 
central metabolic enzyme-specifying genes; double underlined -biosynthetic genes; dqtted.uj(ujerljned - 
translation-associated genes; underlined -rpoS controlled genes 



Table 3. Summary of three E. coli Expression Profiles 



fraction in fraction in 
MM a /exp. b MM/ 



1 . Cell processes 
Cell division- 26 c 
Chemotaxis. motility 

Chemotaxis and mobility- 12 
Folding and usherine proteins -7 
Transport of large molecules 

Protein, peptide secretion-32 
Transport of small molecules 

Amino acids, amines-49 

Anions-20 



phase 



0.011 

0.0014 
0.0032 

0.0082 

0.0091 
0.0029 



transition 
phase 

0.010 

0.00068 
0.0061 

0.01014 

0.0081 
0.0028 



fraction in 
LB/ exp. 
phase 



0.010 

0.0011 
0.011 

0.010 

0.0068 
0.0023 
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Carbohydrates, organic acids, 

alcohols-82 
Cations-52 

Nucleosides, purines, pyrimidines-6 
Other- 12 

2. Elements of externa! origin: 

Laterally acquired elements 
Colicin-related 
functions-5 

Phage-related functions and prophages-27 
Plasm id-related functions- 1 
Transposon-related functions-34 

3. Global functions 

Energy transfer, A TP-proton motive force -9 
Global regulatory functions- 5 1 

4. Macromolecule metabolism 

Basic proteins 

Basic proteins - synthesis, modiflcation-6 
Macromolecule degradation 

Degradation of DNA-23 

Degradation of RNA-1 1 

Degradation of poIysaccharides-3 

Degradation of proteins, peptides, glyco-61 
Macromolecule synthesis, modification 

DNA - replication, repair, restr./modific , n-89 

Lipoprotein- 11 

Phospholipids- 11 

polysaccharides - (cytoplasmic)-6 

proteins - translation and modification-34 

RNA synthesis, modification, DNA transcript' n-27 
Macromolecules 

Glycoprotein 

Lipopolysaccharide- 1 3 
aa-tRNAs 

Amino acyl tRNA syn; tRNA modific'n-40 

5. Metabolism of small 
molecules 

Amino acid biosynthesis 
Biosynthesis of cofactors. 
carriers 

Central intermediary metabolism 

2'-Deoxyribonucleotide metabolism- 1 2 
Amino sugars- 10 
Entner-Douderoff-3 
Gluconeogenesis-4 
Glyoxylate bypass-5 
Misc. glucose metabolism-3 
Non-oxidative branch, pentose pwy-8 
Nucleotide hydrolysis-2 
Nucleotide interconversions-13 
Phosphorus compounds- 17 
Polyamine biosynthesis-8 
Salvage of nucleosides and nucleotides- 1 8 
Sugar-nucleotide biosynthesis, conversions- 18 

40 



0.020 


0.016 


0.034 


0.012 


0.0098 


0.0076 


0.0010 


0.00090 


0.0017 


0.0021 


0.0027 


0.0012 


0.024 


0.017 


0.023 


0.0055 


0.0042 


0.0065 


0.00017 


0.00055 


0.00086 


0.0058 


0.0035 


0.0038 


0.0077 


0.0054 


0.015 


0.0176 


0.029 


0.018 


0.0047 


0.0048 


0.0074 


0.0038 


0.0030 


0.0031 


0.0029 


0.0015 


0.0022 


0.00056 


0.00033 


0.00040 


0.00842 


0.0093 


0.011 


0.023 


0.019 


0.031 


0.0041 


0.0050 


0.0037 


0.0020 


0.0015 


0.0021 


0.0015 


0.0016 


0.00060 


0.029 


0.030 


0.043 


0.010 


0.010 


0.015 


.0015 


0.0012 


0.0018 


0.013 


0.013 


0.021 


0.012 


0.0093 


0.0033 


0.072 


0.069 


0.064 


0.0034 


0.0032 


0.0032 


0.0012 


0.0011 


0.0015 


0.00040 


0.00034 


0.00060 


0.00086 


0.0012 


0.0021 


0.0076 


0.0075 


0.0012 


0.00085 


0.00050 


0.00039 


0.0026 


0.0043 


0.0043 


0.00010 


0.00011 


0.00027 


0.0041 


0.0039 


0.002 


0.0032 


0.0030 


0.0022 


0.0016 


0.0013 


0.0013 


0.0037 


0.0038 


0.0054 


0.0042 


0.0034 


0.0048 
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Sulfur metabolism- 10 

Pool, multipurpose conversions of intermed. Mef- 

46 

Degradation of small molecules 
Amines-9 
Amino acids- 17 
Carbon compounds-90 

Fatty acids- 10 

Other-8 
Energy metabolism, carbon 
Aerobic respiration-27 
Anaerobic respiration-80 
Electron transport-24 
Fermentation-21 
Glycolysis- 18 

Oxidative branch, pentose pwy-2 

Pyruvate dehydrogenase-6 

TCA cycle- 18 
Fattv acid biosynthesis 

Fatty acid and phosphatidic acid biosynthesis-23 
Nucleotide synthesis 

Purine ribonucleotide 
biosynthesis-22 

Pyrimidine ribonucleotide 
biosynthesis- 10 

6. Miscellaneous 

Not classified - 109 

7. Open reading frames 
Unknown proteins- 1 324 

8. Processes 

Adaptation 

Adaptations, atypical conditions- 16 

Osmotic adaptation- 14 
Protection responses 

Cell killing-3 

Detoxification- 1 1 

Drug/analog sensitivity-32 

9. Structural elements 

Cell envelope 

Inner membrane-4 

Murein sacculus, peptidoglycan-34 

Outer membrane 
constituents- 1 7 

Cell exterior constituents- 1 6 

Surface polysaccharides & antigens 

Surface structures-57 
Ribosome constituents 

Ribosomal and stable RNAs-3 

Ribosomal proteins - synthesis, modiflcationRiboso- 

54 

Ribosomes - maturation and modification-6 

10. ORFsn t listed- 102 



PCT/US00/28352 



a AA7Q 


ft ftiY>* 


ft 00095 


A A1 O 
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0.0040 
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0.010 
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A AATA 
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0.025 
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0.012 
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0.0074 


0.0038 


0.0063 


0.0026 


ft ftftftO 


ft OOOl 1 


0 0001 1 


0.0080 


0.0083 


0.0097 


0.0042 


0.0031 


0.0038 


0.0095 


0.012 


A At ^ 

0.013 


0.023 


0.026 


0.020 


0.0037 


0.0039 


0.0062 


0.0075 


0.0051 


0.0052 


0.079 


0.086 


0.15 



0.0056 0.01 1 0.00066 
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a MM ^Minimal medium, ^exp. = exponential, ^e number following each description is the number 
of genes summed 

Table 4. Gene expression elevated by the presence of a sdiA multi-copy plasmid. 



Genes (grouping by function) 


Fold induction 


Genes (grouping by function) 


Fold induction 


1. Cell processes 




8. Not classified 




Cell division 




„ t 

agal 


3.3 


JlsA 


i f\ 

IU. 


cnpA 


"3 A. 


fisQ 


8.8 


»♦ » 
aim 


2.0 


JtsZ 


1 1 


ainr 




minC 


2.1 


envR 


z.z 


minD 


2.7 


ppaB 


Z.y 


minE 


2.4 


sohA 


A A 

4.9 


sdiA 


30. 


sugE 


2.2 


sulA 


2.6 


uvrY 


1 1.9 


Chemotaxis and motility 




9. Open reading frames of 








unknown functions 




Transport of large molecules 




apaO 


2.0 


Protein, peptide secretion 




naea 


LA 


msyB 


2.1 


relE 


2.1 


oppA 


2.1 


sprT 


3.8 


sapB 


2.2 


DUU65 


3.7 


secD 


2.5 


b0097 


2.1 


secF 


2.4 


b0135 


6.4 


Transport of small molecules 




b0137 


2.5 


Amino acids, amines 




b0138 


2.1 


glnH 


4.0 


b0J41 


4.6 




2.5 


b0163 


2.8 


Carbohydrates, organic acids, ale 




b0189 


2.3 


araE 


5.0 


b0224 


2.1 


frvA 


3.2 


b0225 


6.4 


frwD 


2.1 


b0232 


3.1 


gntV-l 


2.0 


b0233 


2.9 


srlB 


2.1 


b0234 


3.1 


xylF 


3.8 


b0245 


2.3 


Cations 




b0269 


3.2 


bfr 


2.0 


b0281 


2.4 


chaA 


2.0 


b0295 


2.8 


feoA 


5.8 


b0300 


2.0 


fepD 


2.1 


b0303 


4.7 


trkG 


4.1 


b0322 


2.5 


2. Elements of external origin: 




b0404 


3.0 


Transposon-related functions 




b0407 


2.9 


rhsC 


6.3 


b0412 


2.4 
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3. Global regulatory functions 




b0443 


2.2 


Ion 


2.4 


b0461 


2.1 


Irp 


2.2 


b0498 


2.3 


lytB? 


3.0 


b0517 


11.2 


rpoE 


2.4 


b0519 


2.5 


rseA 


2.5 


b0530 


6.5 


rseB 


2.2 


b0534 


2.6 


4. Macromolecule metabolism 




b0567 


2.2 


Degradation of proteins, peptides 




b0625 


3.5 


htrA 


2.3 


b07W 


5.2 


hycl 


2.8 


b0711 


4.7 


ptr 


2.1 


b0712 


6.4 


Degradation of DNA 




b07l3 


3.8 


endA 


2.2 


b07J5 


2.2 


mcrB 


2.1 


b0767 


3.3 


mcrC 


3.5 


bl023 


2.7 


recD 


2.4 


bl024 


2.1 


uvrC 


9.3 


bl069 


3.8 


Macromolecule synthesis, modif! 




blll3 


3.8 


DNA - replication, repair, 




bJ2J4 


2.3 


gidA 


4.1 


bl321 


2.1 


gidB 


2.3 


bl438 


11 


hupB 


4.5 


bl451 


2.0 


mioC 


7.0 


bl454 


2.2 


mutH 


2.2 


b!455 


6.4 


nei 


8.6 


b!458 


2.3 


priC 


2.8 


bl463 


2.1 


recN 


3.6 


bl487 


3.0 


umuC 


2.3 


bl491 


2.3 


uvrA 


2.0 


bl498 


3.7 


xerD 


2.3 


bl499 


2.6 


Lipoprotein 




bl504 


2.1 


blc 


2.8 


bl540 


2.5 


nlpC 


3.2 


bl541 


6.2 


vacJ 


2.1 


bJ542 


3.2 


Phospholipids 




bl543 


2.4 


pgsA 


2.4 


bl544 


3.0 


polysaccharides - (cytoplasmic) 




bl545 


4.4 


glgC 


2.1 


b!547 


2.2 




2.1 


bl551 


2.1 


proteins - translation and modific 




bl560 


3.8 


prfif 


2.2 


bl565 


3.4 


Lipopolysaccharide 




b!567 


3.5 


rfaK 


2.3 


bJ568 


2.0 


rfaL 


2.1 


bl579 


2.6 


rfaY 


2.1 


b!586 


2.2 
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rfaZ 


2.5 


bl60I 


4.0 


5. Metabolism of small molecule 




b!606 


8.0 


Amino acids 




bl607 


7.7 


argA 


3.4 


b!624 


2.1 


aroD 


2.6 


b!627 


2.4 


glnA 


2.5 


bl628 


3.0 


ginD 


2.6 


bl632 


2.1 


lysR 


3.6 


b1648 


2.6 


Biosynthesis of cofactors, earner 




b!649 


2.7 


thiD 


2.1 


b!657 


4.7 


thiM 


2.2 


b!664 


2.1 


gst 


3.1 


b!673 


2.4 


Central intermediary metabolism 




bl688 


2.6 


T-Deoxyribonucleotide metaboli 




bl699 


2.0 


nrdA 


2.1 


bl700 


3.7 


Amino sugars 




b!70J 


2.0 


agaD 


4.5 


bJ706 


2.4 


Gluconeogenesis 




b!707 


30 


ppsA 


2.4 


bJ721 


6.5 


Phosphorus compounds 




b!724 


2.1 


psiF 


3.2 


bJ743 


2.2 


Polyamine biosynthesis 




bl744 


2.5 


speC 


9.5 


bI746 


2.9 


Salvage of nucleosides and nucle 




bJ756 


3.2 


apt 


2.4 


bJ789 


3.6 


gsk 


2.4 


bI847 


2.3 


Pool, multipurpose conversions o 




bl848 


2.7 


galM 


2.3 


bJ870 


2.4 


gcvA 


4.6 


bJ87J 


2.8 


glnK 


2.1 


bJ875 


3.4 


pntA 


10.4 


b!877 


2.4 


pntB 


8.2 


b!935 


2.1 


Degradation of small molecules 




bl953 


3.5 


Amino acids 




bl955 


5.1 


tdcB 


2.0 


b!956 


14 


tdcR 




bl965 


5.2 


Carbon compounds 




bl967 


6.6 


fucA 


2.8 


bl968 


2.4 


fucU 


14 


b2006 


4.4 


ealE 


3.8 


b2007 


2.1 


galK 


4.1 


b2015 


3.6 


galT 


4.9 


b20I6 


3.5 


glcD 


2.0 


b2017 


3.8 


gusR (uidR) 


8.0 


b206J 


2.2 


lacA 


3.7 


b207! 


3.1 


lad 


2.5 


b2J45 


2.7 
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uxuC 




b2190 


2.3 


Fatty acids 




b2229 


2.1 


atoD 


3.8 


b2247 


2.3 


Energy metabolism, carbon 




b2253 


2.3 


Aerobic respiration 




b2267 


2.0 


nuoH 


2.1 


b2268 


2.0 


nuol 


2.1 


b2269 


2.6 


Anaerobic respiration 




b2270 


2.2 


dniR 


5.8 


b230I 


3.8 


hybD 


3.8 


b2302 


7.5 


hybE 


12 


b2339 


2.0 


hybF 


6.6 


b2352 


2.6 


hycA 


3.1 


b2356 


2.3 


hycG 


2.7 


b2385 


2.1 


hycH 


3.8 


b2387 


'"3.6 


hydN 


13 


b2419 


3.2 


hypC 


2.3 


b2420 


3.2 


nrfB 


2.7 


b2439 


2.1 


nrJG 


2.1 


b2443 


2.7 


Electron transport 




b2444 


2.1 


appB 


2.5. 


b2445 


2.7 


cybC 


2.1 


b2485 


2.2 


Pyruvate dehydrogenase 




b2505 


2.1 


pdhR 


4.4 


b2597 


2.8 


TCA cycle 




b2628 


2.2 


fumC 


5.8 


b2629 


3.4 


sucA 


2.1 


b263I 


2.9 


sucB 


2.6 


b2632 


4.2 


sucC 


2.2 


b2640 


5.0 


sucD 


2.7 


b2641 


2.3 


Fatty acid and phosphatide acid 




b2642 


15 


acts 


2.0 


b2643 


2.6 


cdh 


2.3 


b2648 


3.7 


Purine ribonucleotide biosynthes 




b2649 


3.0 


purE 


2.1 


b2756 


2.4 


purR 


2.3 


b2767 


2.9 


Pyrimidine ribonucleotide 




b2833 


2.1 


pyrL 


2.1 


b2845 


2.6 


6. Processes 




b2846 


3.0 


Detoxification 




b2851 


2.6 


cutC 


2.1 


b2862 


2.4 


Drug/analog sensitivity 




b2874 


3.3 


acrA 


6.8 


b2912 


2.6 


acrD 


3.0 


b2931 


2.2 


acrE 


14 


b2984 


2.1 


acrF 


6.3 


b3021 


2.4 
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acrR 
ampC 
arsC 
tolC 

7. Structural elements 

Cell envelope 
Inner membrane 

smpA 

Murein sacculus, peptidoglycan 
ddlB 
hipB 
mreD 

Outer membrane constituents 
sip 

Cell exterior constituents 
kdsA 
IpxC 
rfaB? 

Ribosome constituents 
Ribosomal proteins - synthesis, 
rpsL 



4.5 


b3022 


2.7 


2.6 


b3047 


2.2 


2.1 


b3050 


2.2 


2.6 


b3I30 


2.7 




b3l42 


- 




b3254 


3.8 




b3372 


2.2 


2.6 


b3379 


2.0 




b3395 


4.6 


4.6 


b3397 


3.5 


2.0 


b3398 


2.2 


2.0 


b3441 


4.0 




b3465 


2.1 


2.5 


b3467 


2.5 




b3487 


2.1 


2.3 


b3494 


2.2 


3.4 


b3513 


5.3 


2.6 


b3535 


2.2 




b3536 


2.9 




b3548 


2.0 


3.1 


b3615 


2.0 




b3697 


2.9 




b3711 


2.3 




b3712 


2.1 




b3713 


2.1 




b3714 


2.2 




b3719 


2.5 




b3720 


3.0 




b3776 


2.9 




b3820 


2.5 




b3888 


3.4 




b3937 


2.1 




b3944 


2.0 




b3964 


2.2 




b4038 


2.4 




b4068 


2.5 




b4J4I 


2.3 




b4156 


2.2 




b4\9l 


5.1 




b422l 


5.0 




b4222 


4.8 




b4234 


2.9 




b4248 


2.9 




b4282 


2.1 




b4298 


2.3 




b4300 


2.2 
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b4325 


2.1 


b2088 


9.4 


b4404 


5.9 


b4405 


4.2 


yieD (b369S) 


3.8 


b3914 


3.5 


b3913 


3.3 


yjiT{b4352) 


2.8 


yjiU(b4342) 


2.7 


yjjQV>4364) 


2.4 


yhiL(b3486) 


2.3 


b2848 


2.0 


b3573 


2.0 



Table 5. Gene expression reduced by the presence of a sdiA multi-copy plasmid. 



Genes (grouping by function) Fold repression Genes (grouping by function) Fold repression 

1 . Cell processes 6. Structural elements 

Cell division Outer membrane constituents 



fisX 


2.4 


flu 


7.7 


Taxis and mobility 




Cell exterior constituents 




air (aer) 


4.6 


nanA 


3.7 


cheA 


3.7 


Surface structures 




cheB 


3.3 


fithA 


2.6 


cheR 


2.2 




2.3 


cheW 


5.3 




14 


cheY 


4.6 


flgc 


17 


cheZ 


4.0 


flgD 


17 


motA 


2.9 


flgE 


17 


motB 


2.9 


flgF 


7.1 


tar 


5.3 


flgQ 


13 


tsr 


5.9 


flgH 


5.9 


Transport 




flgl 


5.6 


Protein, peptide secretion 




flgJ 


6.3 


dppA 


2.3 


flgK 


6.3 


Amino acids, amines 




flgl 


11 


sdaC 


3.9 


flgM 


7.1 


Carbohydrates, organic acids 




JlgN 


5.6 


alcohols 








fadL 


2.5 


flhA 


3.1 
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glpF 3.6 

glpT 2.5 

lamB 3.0 

malE 3.7 

mglA 2.7 

rbsA 2.6 

/re5 5.3 

Cations 

fecA 3.5 

/ec£ 2.7 

fecE 3.2 

Jfci 2.6 

kdpA 3.5 

2. Elements of external origin: 
Phage-related functions and prophages 

lor 4.0 

nmpC 2. 1 

3. Global regulatory functions 

cytR 2.2 

4. Degradation of macromolec 
DNA 

xseB 4.8 
Proteins, peptides 

pepE 2.6 

5. Metabolism of small molecu 

Pool, multipurpose conversions of intermed. Met* 

glpK 3.5 

glpQ 2.3 

gltF 2.6 
Degradation of small molecule 
Amino acids 

sdaB 3.2 

tnaA 4.0 

ma£ 2.4 
Carbon compounds 

JiicR 2.8 

lacZ 6.7 
ma/M 3.0 



PCTAJS00/28352 



fliA 


10. 


fliC 


14. 


JliD 


4.6 


fliE 


5.9 


JliF 


11 


fliG 


9.1 


JliH 


5.0 


flil 


3.9 


flU 


7.1 


fliK 


4.8 


flil 


7.7 


jliM 


13 


fliN 


5.0 


fliO 


4.4 


fliP 


6.3 


JliR 


3.1 


JliS 


5.9 


fliT 


5.6 


fliZ 


8.3 



Ribosomes - maturation and modification 
gutM 2.1 

7. Not classified 

fsr 2.1 

8. Open reading frames with unknown functions 



b0105 


2.4 


b0235 


2.1 


b0290 


3.5 


b0307 


2.4 


b0704 


2.7 


b0732 


2.0 


bllOO 


2.5 


bll94 


5.0 


b!200 


2.9 


bJ329 


2.2 


b!339 


2.2 


bl383 


2.4 


b!520 


2.7 


bl566 


4.4 
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malT 


2.6 


bl690 


2.1 


treC 


2.6 


b!722 


2.2 


Energy metabolism, carbon 




b!880 


3.6 


Anaerobic respiration 




bl929 


2.1 


hypD 


2.2 


bl930 


3.1 


Fermentation 




b2001 


2.3 


ackA 


3.1 


b2005 


2.0 


aldA 


3.3 


b20!4 


2.4 


pta 


2.3 


b2537 


2.2 


Fatty acid and phosphatidic acid biosynth 


b2844 


2.7 


accD 


3.0 


b3010 


2.2 


Purine ribonucleotide biosynth 




b31Jl 


2.6 


ndk 


2.2 


b3323 


2.9 






b3442 


2.5 






b3539 


2.4 






b3872 


2.1 






yjiZ(b4354) 


3.3 






yjbP (b3877) 


4.0 






yhjH(b3524) 


4.8 
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Table 6. Gene expression profiles of MG 1655 strain when exposed to MMC 



m15 


control 


final 


Gene name 


M40 


contr I 


total 


G ne 






ratio 








ratio 


name 


25115.41 


4700.39 


5.34 


recN 


98536.33 


11895.47 


8.28 


recN 


34003.66 


4565.91 


7.45 


hs!S 


11287.86 


1677.72 


6.73 


hsIS 


1146.46 


769.58 


1.49 


dinl 


10515.16 


1564.72 


6.72 


dinl 


2516.14 


1468.51 


1.71 


sulA 


13936.20 


2416.81 


5.77 


sulA 


4531.55 


2030.86 


2.23 


W1816 


18614.34 


3231.00 


5.76 


W1816 


34497.54 


4438.78 


7.77 


hsIT 


21578.60 


5358.30 


4.03 


hsIT 


8490.60 


4896.22 


1.73 


lexA 


22768.75 


6375.09 


3.57 


lexA 


992.44 


1322.83 


0.75 


W0522 


8750.63 


2628.39 


3.33 


w0522 


3207.65 


2219.58 


i.45 


W1815 


16206.30 


4874.54 


3.32 


W1815 


1844.00 


1864.81 


0.99 


W2141 


2156.14 


691.51 


3.12 


W2141 


21169.76 


11055.79 


1.91 


recA 


34375.81 


11386.61 


3.02 


recA 


756.88 


510.08 


1.48 


smpA 


1614.58 


558.50 


2.89 


smpA 


17140.20 


23337.13 


0.73 


cspA 


12566.23 


4347.87 


2.89 


cspA 


165.71 


396.95 


0.42 


ecpD 


227.53 


80.37 


2.83 


ecpD 


780.07 


941.22 


0.83 


W3019 


802.62 


288.87 


2.78 


W3019 


821.35 


733.39 


1.12 


W2878 


1151:78 


439.75 


2.62 


W2878 


301.47 


384.64 


0.78 


entD 


491.14 


189.39 


2.59 


entD 


385.41 


778.62 


0.49 


fhuC 


818.05 


321.73 


2.54 


fhuC 


7203.63 


6899.86 


1.04 


W1201 


4623.01 


1874.23 


2.47 


W1201 


1476.06 


1291.17 


1.14 


W2999 


1710.57 


709.13 


2.41 


W2999 


269.15 


393.30 


0.68 


caiB 


326.51 


139.49 


2.34 


caiB 


5327:31 


6525.35 


0.82 


infA 


8186.35 


3504.89 


2.34 


infA 


9150.34 


6624.37 


1.38 


uvrA 


33530.31 


14452.89 


2.32 


uvrA 


1657.22 


1816.09 


0.91 


w2879 


3946.52 


1727.30 


2.28 


W2879 


4322.77 


5547.40 


0.78 


insB_2 


6522.25 


2894.35 


2.25 


insB 2 


2310.26 


1778.19 


1.30 


dinD 


5316.67 


2385.63 


2.23 


dinD 


5349.03 


4945.79 


1.08 


secG 


6754.21 


3076.60 


2.20 


secG 


136.50 


367.57 


0.37 


priC 


341.80 


156.11 


2.19 


priC 


617.58 


603.54 


1.02 


W0561 


11989.75 


5479.59 


2.19 


W0561 


2228.66 


2966.21 


0.75 


exbD 


7165.39 


3289.27 


2.18 


exbD 


1282.58 


893.81 


1.43 


umuC 


5697.94 


2659.80 


2.14 


umuC 


6703.55 


7422.39 


0.90 


mioC 


8113.11 


3804.25 


2.13 


mioC 


3289.24 


4228.67 


0.78 


insB 1 


6065.02 


2854.20 


2.12 


insB 1 


4042.60 


3531.25 


1.14 


trkH 


17795.29 


8430.83 


2.11 


trkH 


867.30 


149475 


0.58 


W1345 


1026.33 


487.58 


2.10 


W1345 


541.83 


848.15 


0.64 


dniR 


1878.65 


899.98 


2.09 


dniR 


5469.65 


4392.09 


1.25 


uvrB 


14508.65 


O9ou.7o 


z.Uo 


uvrB 


1561.63 


2155.11 


0.72 


insA_4 


2298.22 


1111.79 


2.07 


insA_4 


5398.87 


3786.47 


1.43 


ruvA 


10492.52 


5134.38 


2.04 


ruvA 


343.85 


654.22 


0.53 


appY 


815.11 


400.28 


2.04 


appY 


18257.00 


17197.04 


1.06 


xseA 


13206.64 


6494.62 


2.03 


xseA 


1863.91 


1771.47 


1.05 


w0224 


6686.92 


3310.73 


2.02 


W0224 


5595.42 


6241.55 


0.90 


W3139 


11174.79 


5555.32 


2.01 


W3139 


1656.98 


1560.47 


1.06 


W2512 


24511.01 


12207.10 


2.01 


W2512 


349.40 


648.41 


0.54 


W3304 


428.93 


850.84 


0.50 


W3304 


297.42 


326.64 


0.91 


w2228 


271.97 


539.98 


0.50 


W2228 


226.05 


501.68 


0.45 


chaB 


721.77 


1433.44 


0.50 


chaB 


678.56 


880.61 


0.77 


cydA 


699.94 


1392.46 


0.50 


cydA 


1422.74 


2311.13 


0.62 


meIR 


5081.98 


10140.17 


0.50 


meIR 


1051.64 


763.06 


1.38 


W1004 


1265.66 


2528.73 


0.50 


w1004 


386.58 


562.93 


0.69 


hofG 


315.25 


630.05 


0.50 


hofG 


513.59 


611.47 


0.84 


W1429 


360.81 


721.16 


0.50 


W1429 


1256.86 


1688.60 


0.74 


w0299 


9464.02 


18943.08 


0.50 


W0299 
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m15 


control 


final 


Gene name 






ratio 




695 83 


807.90 


0.86 


rpiR 


904.58 


1028.97 


0.88 


celD 


2139.22 


1712.26 


1.25 


w0801 


749 35 


812.05 


0.92 


W0241 


827.51 


522.31 


1.58 


W0621 


3746.49 


3442.41 


1.09 


putP 

r * 


428.60 


231.66 


1.85 


W4099 


197.66 


171.90 


1.15 


prsA 


158.38 


114.75 


1.38 


hybD 


83.33 


438.61 


0.19 


sapB 


334.88 


643.56 


0.52 


W3821 


388.49 


568.82 


0.68 


w1459 


275 47 


170.99 


1.61 


aqaD 


309 42 


402.34 


0.77 


ccmD 


328.75 


360.49 


0.91 


cdsG 


421.02 


702.69 


0.60 


relB 


1361 48 

1 WV/ 1 ."TV 


1629.16 


0.84 


w2809 


924.92 


626.14 


1.48 


W0824 


794.28 


965.82 


0.82 


osmE 


277.27 


66.53 


4.17 


W0362 


612 37 


471.85 


1.30 


w1927 


726.43 


629.35 


1.15 


W0211 


779 65 

1 f w.ww 


724.87 


1.08 


W0237 


853.25 


919.94 


0.93 


W2592 


ft?Q 44 


1006 55 

1 WWW. WW 


0.82 


phnH 


v/w / . \J\J 


1161.05 


0.46 


flgA 


656 68 


723.92 


0.91 


w2595 


800 14 


800.17 


1.00 


w2600 


69.56 


216.88 


0.32 


pheL 


892.86 


431.26 


2,07 


W3049 


764.84 


318.13 


2.40 


w1031 


937 41 


1834.97 


0.51 


w0295 


486.50 


540.31 


0.90 


marB 


587.05 


622.95 


0.94 


W0665 


711 12 


397.24 


1.79 


W1016 


Q03 55 

9Uv< w w 


1024.80 


0.88 


w0298 


570 42 


464.88 


1.23 


W0812 


1112 52 


446.12 


2.49 


w2026 


ana 5Q 


321.83 


2.83 


w0715 


739.31 


960 38 


0.77 


ovrL 


1533 58 

I www. WW 


1350.22 


1.14 


menE 


38.16 


216.63 


0.18 


rnb 


707.80 


1099.36 


0.64 


fucR 


973.74 


681.23 


1.43 


w2818 


603 99 

\J\J\f. 


388 89 

WW w. w w 


1.55 


acpD 


610 87 

V lw.w# 


662.57 


0.92 


w0489 


144 Qfi 


121 61 


1 19 

1 . 1 w^ 


ppdA 


43Q 04 


478 63 

■f i U.ww 


0.92 


w1966 


£.11 .ou 


324 71 

w^*t. 1 1 


0.85 


no temDlate 


1245.39 


1015.95 


1.23 


W2401 


570.79 


872.26 


0.65 


W4094 


100.79 


255.85 


0.39 


dicC 


1035.07 


1108.91 


0.93 


w0286 


4510.92 


4439.04 


1.02 


selB 


1625.34 


1568.78 


1.04 


W2733 
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M40 


c ntrol 


t tal 


G ne 






rati 


name 


748.66 


1502.59 


0.50 


rpiR 


2009.30 


4041.66 


0.50 


celD 


2801.71 


5636.83 


0.50 


w0801 


712.18 


1434.35 


0.50 


W0241 


1293.49 


2605.36 


0.50 


W0621 


12834.08 


25859.69 


0.50 


putP 


377.87 


764.43 


0.49 


W4099 


535.40 


1091.60 


0.49 


prsA 


177.80 


363.82 


0.49 


hybD 


501.32 


1027.11 


0.49 


sapB 


412.40 


845.78 


0.49 


W3821 


417.69 


857.67 


0.49 


W1459 


222.68 


459.00 


0.49 


agaD 


646.05 


1331.95 


0.49 


ccmD 


1070.07 


2212.14 


0.48 


cpsG 


1002.28 


2072.50 


0.48 


relB 


3556.91 


7365.32 


0.48 


w2809 


1242.77 


2589.98 


0.48 


W0824 


2901.61 


6072.91 


0.48 


osmE 


1234.58 


2592.98 


0.48 


w0362 


481.28 


1014.05 


0.47 


W1927 


639.90 


1353.69 


0.47 


W0211 


855.65 


1815.70 


0.47 


W0237 


585.96 


1247.13 


0.47 


W2592 


987.67 


2146.99 


0.46 


phnH 


577.37 


1255.75 


0.46 


flgA 


377.12 


821.43 


0.46 


W2595 


792.47 


1732.89 


0.46 


W2600 


93.28 


205.15 


0.45 


pheL 


1146.62 


2524.70 


0.45 


W3049 


1045.43 


2305.88 


0.45 


W1031 


473.49 


1044.97 


0.45 


W0295 


872.85 


1928.83 


0.45 


marB 


786.69 


1744.63 


0.45 


w0665 


1399.58 


3109.10 


0.45 


W1016 


8569.57 


19066.70 


0.45 


w0298 


1381.42 


3073.94 


0.45 


W0812 


1456.45 


3270.04 


0.45 


W2026 


1050.65 


2361.56 


0.44 


W0715 


689.56 


1552.58 


0.44 


pyrL 


1802.56 


4058.69 


0.44 


menE 


703.05 


1594.10 


0.44 


rnb 


1770.89 


4026.45 


0.44 


fucR 


278.71 


634.22 


0.44 


W2818 


1990.24 


4539.42 


0.44 


acpD 


332.52 


758.67 


0.44 


W0489 


177.26 


405.05 


0.44 


ppdA 


488.90 


1118.27 


0.44 


W1966 


333.36 


764.15 


0.44 


no 








template 


1085.45 


2497.91 


0.43 


W2401 


1505.90 


3477.91 


0.43 


w4094 


543.43 


1266.31 


0.43 


dicC 


844.03 


1986.63 


0.42 


W0286 


341.91 


808.87 


0.42 


selB 


3589.01 


8496.95 


0.42 


W2733 
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187.58 


341.03 


0.55 


no template 


301.58 


716.01 


0.42 


no 
















template 


1841.41 


2180.01 


0.84 


trpC 


10007.85 


23940.58 


0.42 


trpC 


m15 


control 


final 


Gene name 


M40 


control 


total 


Gene 






ratio 








ratio 


name 


278.39 


137.91 


2.02 


relF 


449.81 


1077.54 


0.42 


relF 


791.31 


342.34 


2.31 


w1318 


1106.98 


2655.28 


0.42 


W1318 


224.27 


312.05 


0.72 


agaV 


202.28 


490.11 


0.41 


agaV 


791.41 


366.08 


2.16 


w1002 


1315.80 


3189.26 


0.41 


w1002 


890.33 


701.73 


1.27 


w0685 


1124.73 


2728.61 


0.41 


w0685 


622.65 


68113 


0.91 


potH 


500.10 


1228.03 


0.41 


potH 


993.12 


584.36 


1.70 


w2399 


853.26 


2107.06 


0.40 


W2399 


1275.28 


924.58 


1.38 


metA 


991.35 


2449.36 


0.40 


metA 


146.54 


178.28 


0.82 


lytB 


310.99 


770.46 


0.40 


lytB 


419.51 


730.52 


0.57 


w2987 


638.04 


1593.08 


0.40 


W2987 


827.42 


695.16 


1.19 


W0552 


1225.71 


3084.40 


0.40 


W0552 


568.15 


412.40 


1.38 


w1846 


382.41 


968.10 


0.40 


W1846 


114.29 


119.19 


0.96 


dicB 


533.34 


1351.95 


0.39 


dicB 


842.23 


490.95 


1.72 


W1005 


970.38 


2465.94 


0.39 


W1005 


912.78 


720.90 


1.27 


W2587 


509.45 


1301.74 


0.39 


W2587 


1229.46 


751.44 


1.64 


W1260 


963.45 


2507.31 


0.38 


W1260 


1448.48 


1055.79 


1.37 


w3068 


12241.51 


31901.37 


0.38 


W3068 


1010.15 


949.49 


1.06 


w0551 


2143.72 


5619.44 


0.38 


W0551 


794.65 


573.71 


1.39 


W2599 


616.94 


1633.89 


0.38 


W2599 


90572 


963.26 


0.94 


w0569 


912.64 


2444.12 


0.37 


W0569 


1708.81 


2679.92 


0.64 


fruR 


7146.34 


19274.42 


0.37 


fruR 


1170.58 


1168.59 


1.00 


W3927 


1323.78 


3637.02 


0.36 


W3927 


2894.91 


2291.60 


1.26 


W3069 


15445.74 


42484.53 


0.36 


W3069 


1162.90 


1058.22 


1.10 


W0162 


5614.02 


15712.25 


0.36 


W0162 


494.27 


467.07 


1.06 


W0564 


1311.37 


3713.19 


0.35 


W0564 


2542.41 


5907.71 


0.43 


lar 


367.19 


1045.32 


0.35 


lar 


145.60 


352.49 


0.41 


agaB 


174.76 


500.90 


0.35 


agaB 


360.63 


406.91 


0.89 


W0356 


893.83 


2593.46 


0.34 


W0356 


146.66 


228.04 


0.64 


ptrB 


583.62 


1710.66 


0.34 


ptrB 


89.69 


85.82 


1.05 


tdcA 


140.71 


420.20 


0.33 


tdcA 


1569.84 


1131.81 


1.39 


w0005 


854.54 


2591.31 


0.33 


w0005 


949.03 


723.77 


1.31 


W2820 


711.72 


2231.62 


0.32 


W2820 


382.29 


200.74 


1.90 


racC 


145.56 


473.98 


0.31 


racC 


966.21 


528.97 


1.83 


W1323 


814.12 


2717.50 


0.30 


W1323 


2804.26 


3141.20 


0.89 


tolQ 


7715.84 


26048.34 


0.30 


tolQ 


349.28 


732.94 


0.48 


W0535 


238.50 


834.39 


0.29 


W0535 


19047.83 


10107.02 


1.88 


W2546 


13321.17 


46691.65 


0.29 


W2546 


580.46 


495.64 


1.17 


W0553 


734.83 


2584.37 


0.28 


W0553 


213.09 


433.84 


0.49 


W1426 


136.41 


525.28 


0.26 


W1426 


28409.79 


32349.40 


0.88 


gipT 


25094.68 


97868.64 


0.26 


gipT 


271.65 


813.15 


0.33 


sapC 


354.66 


1417.55 


0.25 


sapC 


1502.52 


1107.42 


1.36 


W2597 


631.01 


2727.88 


0.23 


W2597 


274.59 


176.87 


1.55 


ais 


' 706.69 


3093.84 


0.23 


ais 


191.80 


216.29 


0.89 


celA 


864.54 


3806.84 


0.23 


celA 


109.10 


59.92 


1.82 


ppdB 


38.01 


182.15 


0.21 


ppdB 


249.88 


204.52 


1,22 


agaC 


80.45 


386.08 


0.21 


agaC 


56.62 


13.68 


4.14 


hrpA 


546.07 


2814.78 


0.19 


hrpA 


182.61 


92.19 


1.98 


tdcR 


62.07 


330.94 


0.19 


tdcR 


5374.56 


5767.37 


0.93 


spoil 


10944.76 


60533.88 


0.18 


spoU 


456.55 


279.26 


1.63 


w0549 


956.45 


5470.42 


0.17 


W0549 


195.25 


128.74 


1.52 


agaW 


77.80 


464.51 


0.17 


agaW 


556.82 


343.19 


1.62 


W0548 


783.67 


4816.90 


0.16 


w0548 


177.02 


182.96 


0.97 


alpA 


37.14 


237.77 


0.16 


alpA 
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230.79 63.00 3.66 hybF | 85.60 616.14 0.14 hybF 
Gene names written in bold letters are SOS response genes;M15: 15 min exposure 
to MMC;M40: 40 min exposure to MMC 



Table 7. Gene expressions in DM800 and 



Gene name 


b 


DM800 


control 




number 


MMC 




recN 




98536.3 


11895.5 


hsIS 




11287.9 


1677.7 


dinl 




10515.2 


1564.7 


sulA 




13936.2 


2416.8 


W1816 


b1848 


18614.3 


3231.0 


hsIT 




21578.6 


5358.3 


lexA 




22768.8 


6375.1 


W0522 


b0531 


8750.6 


2628.4 


W1815 


b1847 


16206.3 


4874.5 


W2141 


b2181 


2156.1 


691.5 


recA 




34375.8 


11386.6 


smpA 




1614.6 


558.5 


cspA 




12566.2 


4347.9 


ecpD 




227.5 


80.4 


W3019 


b3080 


802.6 


288.9 


W2878 


b2939 


1151.8 


439.8 


entD 




491.1 


189.4 


fhuC 




818.1 


321.7 


W1201 


b1228 


4623.0 


1874.2 


w2999 


b3059 


1710.6 


709.1 


caiB 




326.5 


139.5 


infA 




8186.4 


3504.9 


uvrA 




33530.3 


14452.9 


w2879 


b2940 


3946.5 


1727.3 


insB_2 




6522.2 


2894.3 


dinD 




5316.7 


2385.6 


secG 




6754.2 


3076.6 


priC 




341.8 


156.1 


W0561 


b0571 


11989.7 


5479.6 


exbD 




7165.4 


3289.3 


umuC 




5697.9 


2659.8 


mioC 




8113.1 


3804.3 


insB 1 




6065.0 


2854.2 


trkH 




17795.3 


8430.8 


W1345 


b1374 


1026.3 


487.6 


dniR 




1878.7 


900.0 


uvrB 




14508.7 


6960.8 


insA_4 




2298.2 


1111.8 


ruvA 




10492.5 


5134.4 


appY 




815.1 


400.3 


xseA 




13206.6 


6494.6 


W0224 


b0231 


6686.9 


3310.7 


W3139 


b3199 


11174.8 


5555.3 


W2512 


b2559 


24511.0 


12207.1 


entF 




179.8 


91.5 


glnK 




408.6 


208.4 


insB 4 




6575.4 


3364.4 



DM803 when exposed to MMC 



ratio 


DM803 


DM803 


ratio 




MMC 


control 




8.3 


2454.6 


2089.9 


1.2 


6.7 


426.0 


370.9 


1.1 


6.7 


780.0 


516.8 


1.5 


5.8 


1167.7 


801.5 


1.5 


5.8 


124.7 


900.3 


0.1 


4.0 


1486.7 


962.6 


1.5 


3.6 


2935.3 


2950.2 


1.0 


3.3 


1299.3 


752.0 


1.7 


3.3 


236.0 


660.6 


0.4 


3.1 


0.0 


934.4 


0.0 


3.0 


8505.3 


6677.0 


1.3 


2.9 


591.3 


624.8 


0.9 


2.9 


52966.2 


42356.6 


1.3 


2.8 


384.4 


287.9 


1.3 


2.8 


970.7 


621.6 


1.6 


2.6 


0.0 


373.7 


0.0 


2.6 


491.6 


328.8 


1.5 


2.5 


967.4 


712.6 


1.4 


2.5 


2598.5 


2567.4 


1.0 


2.4 


1279.4 


1182.6 


1.1 


2.3 


834.2 


806.6 


1.0 


2.3 


2964.1 


3046.5 


1.0 


2.3 


6731.4 


5941.1 


1.1 


2.3 


281.8 


1248.5 


0.2 


2.3 


4566.2 


4307.9 


1.1 


2.2 


3182.2 


2522.4 


1.3 


2.2 


9867.2 


8993.5 


1.1 


2.2 


1151.2 


509.8 


2.3 


2.2 


0.0 


636.5 


0.0 


2.2 


1867.7 


1621.7 


1.2 


2.1 


1014.3 


908.3 


1.1 


2.1 


3218.1 


2444.7 


1.3 


2.1 


4533.2 


4432.3 


1.0 


2.1 


2293.2 


1959.1 


1.2 


2.1 


0.0 


522.9 


0.0 


2.1 


1427.1 


899.9 


1.6 


2.1 


5425.4 


5179.7 


1.0 


2.1 


939.3 


1747.8 


0.5 


2.0 


1819.8 


1918.2 


0.9 


2.0 


593.1 


340.6 


1.7 


2.0 


17733.8 


8677.0 


2.0 


2.0 


2353.4 


1760.8 


1:3 


2.0 


2171.5 


3228.1 


0.7 


2.0 


1537.4 


1511.4 


1.0 


2.0 


494.1 


339.5 


1.5 


2.0 


571.9 


212.8 


2.7 


2.0 


2823.1 


2738.2 


1.0 



53 



WO 01/29261 



PCT/US00/28352 



mpA 




5250.2 


2686.3 


2.0 


1722.3 


2000.1 


0.9 


pheP 




2467.2 


1263.6 


2.0 


2974.0 


2187.8 


1.4 


w0491 


b0500 


508.2 


925.1 


0.5 


176.3 


96.9 


1.8 




b4183 


384.5 


701.1 


0.5 


777.9 


583.0 


1.3 


WUJ/ u 


b0580 


1956.2 


3578.9 


0.5 


867.8 


197.6 


4.4 


\a/0221 

Vrvtfc 1 


t>0228 


1612.4 


2950.5 


0.5 


326.2 


420.6 


0.8 


w1347 


b1376 


5383.3 


9852.6 


0.5 


1194.3 


1736.1 


0.7 


mm 




540.5 


989.7 


0.5 


0.0 


0.0 




xylF 




2117.7 


3885.5 


0.5 


4289.6 


3178.7 


1.3 


w2284 


b2325 


452.5 


830.6 


0.5 


894.8 


911.0 


1.0 


w0627 


b0637 


1984.5 


3647.0 


0.5 


0.0 


299.3 


0.0 


nip 




487.4 


895.9 


0.5 


1004.9 


549.6 


1.8 


w2940 


b3001 


3777.0 


6949.2 


0.5 


2725.9 


1950.7 


1.4 


w0591 


b0601 


1658.1 


3060.6 


0.5 


602.0 


402.6 


1.5 


w0270 


b0278 


1175.1 


2170.8 


0.5 


493.9 


428.3 


1.2 


W2790 


b2849 


455.2 


841.4 


0.5 


489.3 


455.5 


1.1 






1174.7 


2171.6 


0.5 


752.3 


509.1 


1.5 


w1018 

Vr 1 U 1 U 


b1045 


2359.6 


4364.5 


0.5 


525.2 


272.8 


1.9 




b2767 


1312.0 


2440.9 


0.5 


597.4 


383.4 


1.6 


W2329 


b2371 


594,8 


1107.2 


0.5 


861.9 


690.9 


1.2 


atdB 




1218.9 


2269.2 


0.5 


4969.4 


3102.9 


1.6 


w2791 


b2850 


773.2 


1441.3 


0.5 


508.9 


523.1 


1.0 


W2605 


b2659 


6211.7 


11600.3 


0.5 


3614.7 


2885.4 


1.3 


cmtB 




141.2 


263.8 


0.5 


275.0 


164.4 


1.7 


w3890 


b3975 


824.3 


1541.5 


0.5 


607.0 


403.6 


1.5 


nntV 




1242.4 


2324.3 


0.5 


311.4 


446.1 


0.7 


dspA 




92135.1 


173324.7 


0.5 


10420.0 


16655.6 


0.6 


w3210 


b3268 


479.3 


903.6 


0.5 


551.4 


372.6 


1.5 


wfl?71 

W U^l f 1 


b0279 


792.9 


1496.9 


0.5 


471.2 


426.9 


1.1 


W 1 UUJ 


b1029 


2145.7 


4051.8 


0.5 


2.6 


0.0 




w1017 


b1044 


1431.3 


2703.5 


0.5 


1035.5 


413.5 


2.5 


fpoA 




470.5 


888.8 


0.5 


1313.4 


1090.4 


1.2 






1657.3 


3141.1 


0.5 


568.1 


558.4 


1.0 


w0619 


b0629 


2054.3 


3896 8 


0.5 


289.0 


43.4 


6.7 


viiM 
yjjivi 


b4357 


1966.9 


3732.7 


0.5 


2090.3 


1457.0 


1.4 


w2816 


b2876 


1687.7 


3205.6 


0.5 


1046.1 


858.8 


1.2 


w3272 


b3337 


587.3 


1115.5 


0.5 


430.3 


359.9 


1.2 


w3443 


b3507 


446.9 


849.1 


0.5 


591.4 


598.3 


1.0 




b0359 


1435.0 


2731.1 


0.5 


0.0 


72.1 


0.0 






253.0 


482.4 


0.5 


726.3 


421.9 


1.7 


w09fi3 


b0271 


955.9 


1824.5 


0.5 


484.0 


330.6 


1.5 






522 9 


998.6 


0.5 


703.7 


599.6 


1.2 


f] m 7 
in i \t~ 




536 1 


1024.6 


0.5 


424.3 


343.7 


1.2 


\a#9^14 


h935fi 


581 8 


1113.3 


0.5 


1114.7 


731.5 


1.5 


w25Q6 


52649 


370.7 


710.0 


0.5 


1024.4 


605.2 


1.7 


\/iri f 


53823 


542.9 


1039.7 


0.5 


1372.3 


817.5 


1.7 






1003.1 


1922.7 


0.5 


177.0 


90.5 


2.0 


w135Q 


b1388 


2953.5 


5661.9 


0.5 


328.5 


262.2 


1.3 


WW/ \J 


b0690 


1558.5 


2992.7 


0.5 


249.4 


313.2 


0.8 


fixX 




313.0 


601.3 


0.5 


1259.9 


891.9 


1.4 




b4188 


953.2 


1839.0 


0.5 


901.6 


606.7 


1.5 


w0674 


b0691 


1466.3 


2840.2 


0.5 


129.0 


350.8 


0.4 


W2400 


b2447 


1214.0 


2355.8 


0.5 


810.8 


406.3 


2.0 


W1963 


b2004 


537.0 


1043.7 


0.5 


667.7 


598.5 


1.1 


W3974 


b4066 


455.1 


884.6 


0.5 


816.0 


462.9 


1.8 


W1242 


b1271 


3047.5 


5958.6 


0.5 


1870.6 


336.3 


5.6 


W0525 


b0534 


930.8 


1822.0 


0.5 


829.3 


490.0 


1.7 
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W4132 


b4227 


w3304 


b3369 


w2228 


b2269 


chaB 




cydA 




meIR 




W1004 


b1030 


hofG 




W1429 


b1458 


W0299 


b0307 


rpiR 




celD 




w0801 


b0825 


W0241 


b0249 


w0621 


b0631 


DUtP 




w4099 


b4194 


prsA 




hybD 




sapB 




W3821 


b3901 


w1459 


b1488 


agaD 




ccmD 








relB 




W2809 


b2869 


w0824 


b0848 


osmE 




W0362 


b0370 


W1927 


b1963 


w0211 


b0218 


w0237 


b0245 


w2592 


b2645 


nhnH 








W2595 


b2648 


w2600 


b2654 


pheL 




w3049 


b3107 


w1031 


b1058 


w0295 


b0303 


marB 




w0665 


b0682 


W1016 


b1043 


w0298 


b0306 


w0812 


b0836 


W2026 


b2067 


w0715 


b0732 






menE 




mb 




fucR 




W2818 


b2878 


acpD 




W0489 


b0498 


ppdA 




W1966 


b2007 



1602.0 


3151.2 


0.5 


428.9 


850.8 


0.5 


272.0 


540.0 


0.5 


721.8 


1433.4 


0.5 


699.9 


1392.5 


0.5 


5082.0 


10140.2 


0.5 


1265.7 


2528.7 


0.5 


315.2 


630.0 


0.5 


360.8 


721.2 


0.5 


9464.0 


18943.1 


0.5 


7487 


1502.6 


0.5 


2009.3 


4041.7 


0.5 


2801.7 


5636.8 


0.5 


712.2 


1434.3 


0.5 


1293.5 


2605.4 


0.5 


12834.1 


25859.7 


0.5 


377.9 


764.4 


0.5 


535.4 


1091.6 


0.5 


177.8 


363.8 


0.5 


501.3 


1027.1 


0.5 


412.4 


845.8 


0.5 


417.7 


857.7 


0.5 


222.7 


459.0 


0.5 


646.1 


1332.0 


0.5 


1070.1 


2212.1 


0.5 


1002.3 


2072.5 


0.5 


3556.9 


7365.3 


0.5 


1242.8 


2590.0 


0.5 


2901.6 


6072.9 


0.5 


1234.6 


2593.0 


0.5 


481.3 


1014.0 


0.5 


639.9 


1353.7 


0.5 


855.9 


1815.7 


0.5 


586.0 


1247.1 


0.5 


987.7 


2147.0 


0.5 


577.4 


1255.7 


0.5 


377.1 


8214 


0.5 


792.5 


1732.9 


0.5 


93.3 


205.1 


0.5 


1146.6 


2524.7 


0.5 


1045.4 


2305.9 


0.5 


473.5 


1045.0 


0.5 


872.9 


1928.8 


0.5 


786.7 


1744.6 


0.5 


1399.6 


3109.1 


0.5 


8569.6 


19066.7 


0.4 


1381.4 


3073.9 


0.4 


1456.5 


3270.0 


0.4 


1050.7 


2361.6 


0.4 


689.6 


1552.6 


0.4 


1802.6 


4058.7 


0.4 


703.1 


1594.1 


0.4 


1770.9 


4026.5 


0.4 


278.7 


634.2 


0.4 


1990.2 


4539.4 


0.4 


332.5 


758.7 


0.4 


177.3 


405.0 


0.4 


488.9 


1118.3 


0.4 
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3125.1 


2509.8 


1.2 


728.3 


701.9 


1.0 


482.1 


399.3 


1.2 


177.0 


0.0 




963.9 


305.9 


3.2 


1511.6 


1528.2 


1.0 


27.8 


163.0 


0.2 


694.2 


767.7 


0.9 


711.8 


488.3 


1.5 


2048.3 


2846.3 


0.7 


536.3 


419.6 


1.3 


1094.3 


303.1 


3.6 


920.2 


417.5 


2.2 


772.1 


649.4 


1.2 


0.0 


170.6 


0.0 


5693.7 


6221.7 


0.9 


709.5 


539.0 


1.3 


24.5 


0.0 




510.9 


573.9 


0.9 


258.8 


0.0 




945.9 


539.7 


1.8 


994.9 


740.6 


1.3 


966.6 


465.2 


2.1 


92.3 


64.5 


1.4 


0.0 


8.9 


0.0 


0.0 


92.3 


0.0 


1264.0 


1453.8 


0.9 


370.9 


350.1 


1.1 


350.0 


900.3 


0.4 


0.0 


0.0 




824.4 


612.6 


1.3 


198.7 


321.6 


0.6 


355.4 


461.1 


0.8 


1032.5 


566.6 


1.8 


1103.3 


937.7 


1.2 


1482.3 


1396.3 


1.1 


647.4 


431.6 


1.5 


222.5 


309.6 


0.7 


256.4 


258.9 


1.0 


53.4 


180.9 


0.3 


188.9 


101.9 


1.9 


468.2 


461.2 


1.0 


843.0 


130.7 


6.5 


0.0 


592.3 


0.0 


419.6 


203.5 


2.1 


1722.6 


2401.5 


0.7 


356.8 


314.2 


1.1 


22.0 


337.9 


0.1 


154.8 


30.1 


5.1 


520.5 


338.5 


1.5 


4640.7 


928.2 


5.0 


399.4 


0.0 




2264.4 


1572.6 


1.4 


787.6 


511.9 


1.5 


0.0 


287.0 


0.0 


137.3 


183.4 


0.7 


662.6 


505.3 


1.3 


1163.2 


680.8 


1.7 



WO 01/29261 



"no 




tp mn late" 




w2401 


b2448 


w4094 


b4189 


dicC 




WUtOO 








w2733 


b2789 


IIU 




lei up laic 




trpC 




relF 




w1318 


b1347 


aaaV 






b1028 


WUOOO 




nntH 
puin 






K044R 


meir\ 




lyiD 




\A/9Qft7 


h^fi47 
OOU*t f 


WUOOZ 




w i o**o 


h1R7fi 


HirR 




xa/1 nn 1 ^ 

W Iuv5/ 


h1031 




b2640 




h1989 


WOUDO 


h3197 


WUjO 1 


UUwU 1 


ia/9RQQ 




WUOOiJ 


hOR7Q 
v\j%j i o 


fmR 

It Ul\ 








w3069 


b3128 




b0162 

Uv 1 w£ 




bO l 574 


Id! 




oUaD 




Xl/fl^R 

wyooo 




ntrR 




trlr*A 




wuuuo 


UUUVsl 




DZOOU 


raco 




w1323 


M352 






w0535 


b0545 








hOSR3 


w1426 


b1455 


yip i 




sapC 




W2597 


b2650 


ais 




celA 




ppdB 




agaC 




hrpA 





333.4 


764.2 


0.4 


1085.4 


2497.9 


0.4 


1505.9 


3477.9 


0.4 


543.4 


1266.3 


0.4 


844.0 


1986.6 


0.4 


341.9 


808.9 


0.4 


3589.0 


8496.9 


0.4 


301.6 


716.0 


0.4 


10007.8 


23940.6 


0.4 


449.8 


1077.5 


0.4 


1107.0 


2655.3 


0.4 


202.3 


490.1 


0.4 


1315.8 


3189.3 


0.4 


1124.7 


2728.6 


0.4 


500.1 


1228.0 


0.4 


853.3 


2107.1 


0.4 


991.4 


2449.4 


0.4 


311.0 


770.5 


0.4 


638.0 


1593.1 


0.4 


1225.7 


3084.4 


0.4 


382.4 


968.1 


0.4 


533.3 


1351.9 


0.4 


970.4 


2465.9 


0.4 


509.5 


1301.7 


0.4 


963.5 


2507.3 


0.4 


12241.5 


31901.4 


0.4 


2143.7 


5619.4 


0.4 


616.9 


1633.9 


0.4 


912.6 


2444.1 


0.4 


7146.3 


19274.4 


0.4 


1323.8 


3637.0 


0.4 


15445.7 


42484.5 


0.4 


5614.0 


15712.2 


0.4 


1311.4 


3713.2 


0.4 


367.2 


1045.3 


0.4 


174.8 


500.9 


0.3 


893.8 


2593.5 


0.3 


583.6 


1710.7 


0.3 


140.7 


420.2 


0.3 


854.5 


2591.3 


0.3 


711.7 


2231.6 


0.3 


145.6 


474.0 


0.3 


814.1 


2717.5 


0.3 


7715.8 


26048.3 


0.3 


238.5 


834.4 


0.3 


13321.2 


46691.6 


0.3 


734.8 


2584.4 


0.3 


136.4 


525.3 


0.3 


25094.7 


97868.6 


0.3 


354.7 


1417.6 


0.3 


631.0 


2727.9 


0.2 


706.7 


3093.8 


0.2 


864.5 


3806.8 


0.2 


38.0 


182.1 


0.2 


80.5 


386.1 


0.2 


546.1 


2814.8 


0.2 
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470.2 


368.3 


1.3 


1160.0 


715.4 


1.6 


1022.3 


932.0 


1.1 


319.5 


67.2 


4.8 


498.7 


447.4 


1.1 


7781.6 


7090.5 


1.1 


5638.1 


3562.2 


1.6 


408.2 


329.1 


1.2 


421.9 


2562.2 


0.2 


0.0 


0.0 




1.4 


308.5 


0.0 


673.0 


669.2 


1.0 


288.3 


62.7 


4.6 


198.6 


229.5 


0.9 


641.9 


243.8 


2.6 


770.2 


485.4 


1.6 


2026.6 


1816.4 


1.1 


353.9 


181.5 


1.9 


509.8 


303.9 


1.7 


0.0 


588.7 


0.0 


1322.3 


1154.7 


1.1 


260.1 


54.8 


4.7 


0.0 


106.0 


0.0 


557.1 


371.8 


1.5 


0.0 


353.6 


0.0 


2383.9 


3185.5 


0.7 


477.1 


864.7 


0.6 


456.5 


107.8 


4.2 


673.5 


612.3 


1.1 


2076.3 


2369.5 


0.9 


2685.9 


2131.5 


1.3 


499.8 


1153.3 


0.4 


4564.1 


3889.2 


1.2 


584.5 


175.8 


3.3 


82.1 


122.0 


0.7 


371.8 


504.1 


0.7 


10.4 


32.6 


0.3 


246.4 


69.2 


3.6 


612.3 


343.4 


1.8 


442.3 


388.2 


1.1 


904.4 


609.4 


1.5 


167.4 


26.9 


6.2 


297.1 


235.6 


1.3 


1213.4 


1001.4 


1.2 


114.4 


162.5 


0.7 


2567.6 


4164.9 


0.6 


0.0 


277.1 


0.0 


672.5 


358.6 


1.9 


7769.7 


14648.6 


0.5 


546.0 


0.0 




1000.0 


566.4 


1.8 


352.9 


114.6 


3.1 


0.0 


512.6 


0.0 


399.9 


345.8 


1.2 


598.1 


602.8 


1.0 


0.0 


147.3 


0.0 
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tdcR 62.1 330.9 

spoU 10944.8 60533.9 

w0549 b0559 956.4 5470.4 

agaW 77.8 464.5 

W0548 b0558 783.7 4816.9 

alpA 37.1 237.8 

hybF 85.6 616.1 



0.2 


350.0 


232.7 


1.5 


0.2 


4024.5 


3163.9 


1.3 


0.2 


115.5 


287.2 


0.4 


0.2 


479.0 


779.3 


0.6 


0.2 


327.8 


85.0 


3.9 


0.2 


366.3 


269.2 


1.4 


0.1 


332.8 


456.6 


0.7 



Table 8. Gene expressions in DM800 and DM803 when exposed to MMC 



Gene 


bj 


DM800- 


DM800- 


DM800 ratio 


DM803- 


DM803- 


DM803 ratio 


name 




MMC 


control 


(MMC/control) 


MMC 


control (MMC/contro 
h 


chaB 




721.8 


1433.4 


0.5 


177.0 


0.0 




gusC 




1069.6 


1178.1 


0.9 


39.2 


0 0 




prsA 




535.4 


1091.6 


0.5 


24.5 


0 0 




rnb 




703.1 


1594.1 


0.4 


399.4 


0 0 




rspB 




987.2 


1520.0 


0.6 


480.4 


0.0 




sapB 




501.3 


1027.1 


0.5 


258.8 


0.0 




sapC 




354.7 


1417.6 


0.3 


546.0 


0.0 




uxaB 




2051.4 


2423.2 


0.8 


527.4 


0 0 




w0367 


b0375 


1010.4 


1827.8 


0.6 


179.0 


0 0 




W0492 


b0501 


717.2 


769.2 


0.9 


85.3 


ft 0 

V/.V 




W0521 


b0530 


425.3 


746.2 


0.6 


169.9 


n ft 
u.u 




W0537 


b0547 


711.1 


851.5 


0.8 


87.9 


ft ft 
u.u 




W0544 


b0554 


602.7 


558.1 


1.1 


98.8 


ft ft 
u.u 




w1114 


M141 


1523.8 


1839.4 


0.8 


30.1 






W0508 


b0517 


2160.9 


2528.0 


0.9 


480.7 


1 ft 
I .u 


A57 Q 


W0541 


b0551 


542.2 


591.7 


0.9 


140.0 


7 ft 


1 8 A 


moIR 




4939.4 


4515.9 


1.1 


5518.5 


51 ft 


1ft 7 


w2514 


b2561 


7715.4 


7319.4 


1.1 


23955.8 


91R7 ft 

ZOO # -O 


1ft 1 

IV/. 1 


moIR 




3903.5 


3852.3 


1.0 


2355.4 


0A1 ft 


Q 7 


W1329 


b1358 


1909.4 


1872.1 


1.0 


649.0 


75 O 


ft ft 


W1381 


b1410 


2104.8 


1895.7 


1.1 


2603.5 


**ftQ ft 


ft A 


W1088 


b1115 


3071.3 


4218.6 


0.7 


1543.9 


A OA O 


ft A 


dsdX 




2030.3 


2282.9 


0.9 


1833.5 


007 ft 

£.£.1 .U 


ft 1 
O. 1 


thrS 




277.1 


318.4 


0.9 


1498.9 


196.8 


7.6 


W0873 


b0898 


2210.6 


1636.1 


1.4 


1001.4 


132.9 


7.5 


W0617 


b0627 


4834.3 


7145.9 


0.7 


2171.5 


301.1 


7.2 


w3258 


b3323 


4107.7 


3439.8 


1.2 


9485.2 


1318.6 


7.2 


w0999 


b1025 


1233.1 


888.5 


1.4 


199.5 


28.3 


7.1 


rspA 




1862.1 


2002.0 


0.9 


1297.2 


191.4 


6.8 


W0843 


b0867 


2593.1 


2957.8 


0.9 


1333.5 


197.9 


6.7 


W0619 


b0629 


2054.3 


3896.8 


0.5 


289.0 


43.4 


6.7 


marB 




872.9 


1928.8 


0.5 


843.0 


130.7 


6.5 


W1128 


b1155 


865.4 


1384.1 


0.6 


157.3 


24.8 


6.3 


napH 




1548.4 


1771.5 


0.9 


2445.8 


392.2 


6.2 


racC 




145.6 


474.0 


0.3 


167.4 


26.9 


6.2 


W1266 


b1295 


1834.3 


3178.5 


0.6 


1111.8 


178.6 


6.2 


w2857 


b2918 


2013.5 


2025.3 


1.0 


2905.4 


475.0 


6.1 


pfkB 




5266.9 


4949.8 


1.1 


1148.4 


188.9 


6.1 


W1309 


b1338 


3612.6 


4271.9 


0.8 


3943.8 


660.9 


6.0 


napG 




4522.3 


4522.5 


1.0 


5243.5 


900.1 


5.8 


w1242 


b1271 


3047.5 


5958.6 


0.5 


1870.6 


336.3 


5.6 


W0991 


b1017 


2292.7 


1982.0 


1.2 


564.2 


101.9 


5.5 


vsr 




5927.8 


3617.9 


1.6 


3758.3 


687.4 


5.5 
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w0996 


b1022 


954.0 


881.7 


w0957 


b0983 


1903.7 


1772.1 


w1287 


b1316 


4404.7 


4724.2 


w1113 


b1140 


2063.9 


2466.4 


w1168 


b1195 


1068.7 


1662.1 


w0918 


b0943 


964.3 


1122.4 


hsIJ 




1701.0 


1353.6 


fdnH 




2916.7 


3353.8 


w0715 


b0732 


1050.7 


2361.6 


1 1 ICI 1 1 — 




1802.6 


4058.7 


w0511 


b0520 


2664.2 


3109.8 


w2508 


b2555 


3575.2 


2467.8 


pspC 




670.6 


809.3 


w0767 


b0791 


2562.9 


3359.1 


dicC 




543.4 


1266.3 


dicB 




533.3 


1351.9 


chaC 




3102.1 


3008.4 


w0387 


b0395 


2772.4 


3361.1 


w0986 


b1012 


1670.0 


1494.6 


VV I UU4 


b1028 


1315.8 


3189.3 


w0570 


b0580 


1956.2 


3578.9 


w1972 


b2013 


2191.0 


2234.9 


natR 

y airv 




16844.2 


12487.4 


w2582 


b2634 


3769.3 


4821.9 


w2599 


b2653 


616.9 


1633.9 


w0793 


b0817 


9409.7 


8201.4 




b3325 


1313.1 


1350.2 


w0888 


b0913 


3553.0 


2732.1 


w0446 


b0454 


1294.0 


1873.1 


xthA 




7322.0 


7765.8 


w2574 


b2626 


1092.4 


1842.3 


w1211 

W It 1 1 


b1240 


2015.0 


1589.9 






5381 .6 


3197.6 


ncmR 




1731.8 


2164.1 


W 1 ouo 


b1337 


1961.3 


2585.3 


w1210 


b1239 


1397.2 


1027.0 


w0616 


b0626 


2728.7 


3046.4 


w0548 


b0558 


783.7 


4816.9 


dsrB 




803.9 


1220.8 


w1382 


b1411 


2709.0 


2309.7 


alkB 




1424.0 


1719.3 


nvrF 
yryir 




4511.4 


3724.1 


w1292 


b1321 


12154.3 


8766.6 


w1331 


b1360 


2085.1 


2530.6 




b0788 


2579.2 


2813.6 


mcrA 




2912.9 


3609.3 






2009.3 


4041.7 


w3213 


b3271 


1521.2 


1151.6 




b1024 


2410.5 


1957.8 


ntrR 




583.6 


1710.7 


malY 
II idi i 




3296.2 


3398 9 


yJl in 




632.2 


648.7 


W3013 


b3074 


2711.4 


2653.2 


W3259 


b3324 


840.9 


1191.9 


W0719 


b0736 


2884.4 


2820.3 


W0765 


b0789 


3699.1 
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0.0 
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Table 9 

Most highly expressed genes in Synechocystis sp. PCC6803 in minimal growth media 
5 (BG1 1 + 5mM glucose). 



Systematic 
Name 


Gene 


Function 


Transcript 
copy in total 
mRNA 
(Average 
copy=l) 


slr2051 


cpcG 


. . 

phycobilisome rod-core linker 

polypeptide CpcG 


&A Q1 


S111580 


cpcC 


phycocyanin associated linker protein 


22.71 


slr0447 


amiC 


negative aliphatic amidase regulator 




S111070 


tktA 


transketolase 


19.24 


SU0018 


cbbA 


r* 1 f t • 1 1 • III 

fructose- 1 , 6-bisphosphate aldolase 


14.27 


slrOOl 1 


rbcX 


ND 


1 C\f\ 

12.00 


ssl0563 


psaC 


photosystem I subunit VII 


ii 1 1 
1 1.31 


slrl655 


psaL 


photosystem I subunit XI 


10.91 


sll0819 


psaF 


photosystem I subunit III 


1 A CiC 

10.56 


sill 867 


psbA3 


photosystem 11 Dl protein 


1 A A1 

10.43 


SI11324 


atpF 


ATP synthase subunit b 


10.37 


sill 746 


rpll2 


SOS ribosomal protein LI 2 


1 A 1 *i 

10.13 


slll099 


tufA 


protein synthesis elongation factor Tu 


9.48 


slr0009 


rbcL 


ribulose bisphosphate carboxylase large 
subunit 


8.39 


slr0012 


rbcS 


ribulose bisphosphate carboxylase small 
subunit 


8.14 


slll326 


atpA 


ATP synthase a subunit 


7.72 


slrl908 




ND* 


7.62 


S111578 


cpcA 


phycocyanin a subunit 


7.60 


slr2067 


apcA 


allophycocyanin a chain 


7.51 


i slr2052 




ND* 


7.41 


sill 184 


ho 


heme oxygenase 


7.27 


ssl3437 


rpsl7 


30S ribosomal protein S17 


• 7.26 


S111786 




hypothetical protein (ND*) 


7.16 


ss!0020 


petF 


ferredoxin 


7.07 


S111812 


rps5 


3 OS ribosomal protein S5 


7.04 



* ND = not determined 
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Table 10 

Most highly induced genes in Synechocystis sp. PCC6803 in BG1 1 media containing 5 



Systematic 
Name 


Gene 


Function 


Data 
/control 


STD 


ssr2595 


hliB 


—t : : — t : 

High light- inducible protein 


zz. / 


A 1 


slrl544 




ND* 


i <. < 
O.j 


/.0 


sI10528 




ND* 


IZ.I 


1 Q 


S111514 


hspl7 


small heat shock protein 


y.y 




sir 1687 


nblB 


phycobilisome degradation protein NblB 


O.Z 


1 Q 

i.y 


sill 483 




transforming growth factor induced protein 




Z.Z 


sll2012 


rpoD 


RNA polymerase sigma factor 


o.J 


Z.U 


ssll633 




CAB/ELIP/HLIP superfamily 


6.0 


1 A 

1.0 


ssl2542 


hliA 


high light-inducible protein 


5.o 


1 £ 


SllUo4o 




JNU 


4 7 


0.9 


sir 1674 




ND* 


4.7 


1.8 


sir 1604 


ftsH 


Chloroplast associated protease FtsH 


4.6 


1.9 


slr0320 




ND* 


4.5 


2.2 


S110306 


rpoD 


RNA polymerase sigma factor 


4.4 


1.0 


slr0228 


ftsH 


cell division protein FtsH 


4.3 


1.7 


slrl641 


dpB 


ClpB protein 


4.3 


1.1 


ssr2016 




ND* 


4.2 


2.2 


sill 867 


psbA3 


photosystem II Dl protein 


4.1 


0.3 



* ND = not determined 
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CLAIMS 

What is claimed is: 

1 . A method for identifying gene expression changes within a bacterial 
species comprising: 

5 (a) providing a comprehensive micro-array synthesized from DNA 

comprised in a bacterial species; 

(b) generating a first set of labeled probes from bacterial RNA, the 
RNA isolated from the bacterial species of step (a); 

(c) hybridizing the first set of labeled probes of step (b) to the 
10 comprehensive micro-array of step (a), wherein hybridization 

results in a detectable signal generated from the labeled probe; 

(d) measuring the signal generated by the hybridization of the first 
set of labeled probe to the comprehensive micro-array of 

step (c); 

15 (e) subjecting the bacterial species of step (a) to a gene expression 

altering condition whereby the gene expression profile of the 
bacterial species is altered to produce a modified bacterial 
species ; 

(f) generating a second set of labeled probes from bacterial RNA, 
20 the RNA isolated from the modified bacterial species of step (e); 

(g) hybridizing the second set of labeled probes of step (f) to the 
comprehensive micro-array of step (a), wherein hybridization 
results in a detectable signal generated from the labeled probe; 

(h) measuring the signal generated by the hybridization of the 

25 second set of labeled probes to the comprehensive micro-array 

of step (g); and 

(i) comparing signal generated from the first hybridization to the 
signal generated from the second hybridization to identify gene 
expression changes within a bacterial species. 

30 2. A method for identifying gene expression changes within a bacterial 

species comprising: 

(a) providing a comprehensive micro-array synthesized from DNA 
comprised in a bacterial species ; 

(b) generating a first set of fluorescent cDNA from bacterial RNA, 
35 the RNA isolated from the bacterial species of step (a); 

(c) hybridizing the first set of fluorescent cDNA of step (b) to the 
comprehensive micro-array of step (a), wherein hybridization 
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results in a detectable signal generated from the fluorescent 
cDNA; 

(d) measuring the signal generated by the hybridization of the first 
set of fluorescent cDNA to the comprehensive micro-array of 

5 step (c); 

(e) subjecting the bacterial species of step (a) to a gene expression 
altering condition whereby the gene expression profile of the 
bacterial species is altered to produce a modified bacterial 
species; 

10 (f) generating a second set of fluorescent cDNA from bacterial 

RNA, the RNA isolated from the modified bacterial species of 
step (e); 

(g) hybridizing the second set of fluorescent cDNA of step (f) to the 
comprehensive micro-array of step (a), wherein hybridization 

15 results in a detectable signal generated from the fluorescent 

cDNA; 

(h) measuring the signal generated by the hybridization of the 
second set of fluorescent cDNA to the comprehensive micro- 
array of step (g); and 

20 (i) comparing signal generated from the first hybridization to the 

signal generated from the second hybridization to identify gene 
expression changes within a bacterial species . 

3. A method according to either Claim 1 or 2 wherein the bacterial 
species is selected from the group consisting of enteric bacteria, Bacillus, 

25 Acinetobacter, Streptomyces, Methylobacter, Pseudomonas, Rhodobacter and 
Synechocystis 

4. A method according to either Claim 1 or 2 wherein the signal 
generating label is selected from the group consisting of fluorescent moieties, 
chemiluminescent moieties, particles, enzymes, radioactive tags. 

30 5. A method according to Claim 4 wherein the signal generating label is 

a fluorescent moiety and is selected from the group consisting of cy3 and cy5. 

6. A method according to either Claim 1 or 2 wherein the comprehensive 
micro-array contains at least 75% of all open reading frames in the bacterial 
species. 

35 7. A method according to Claim 6 wherein the comprehensive micro- 

array contains from about 2000 to about 6000 open reading frames. 

8. A method according to either Claim 1 or 2 wherein the gene 
expression altering condition is selected from the group consisting of a condition 
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altering the genotype of the bacterial species, a condition altering the growth of 
the bacterial species , exposure to mutagens , antibiotics, UV light, gamma-rays, 
x-rays, phage, macrophages, organic chemicals, inorganic chemicals, 
environmental pollutants, heavy metals, changes in temperature, changes in pH, 
5 conditions producing oxidative damage, DNA damage, anaerobiosis, depletion or 
addition of nutrients, addition of a growth inhibitor, and desiccation. 

9. A method for identifying gene expression changes within a genome 
comprising: 

(a) providing a comprehensive micro-array synthesized from DNA 
10 comprised in a prokaryotic or eukaryotic speices; 

(b) generating a control set of fluorescent cDNA from total or 
polyadenylated RNA, the RNA isolated from the species of 
step (a), the fluorescent cDNA comprising at least one first 
fluorescent label and at least one different second fluorescent 

15 label; 

(c) mixing the control set of fluorescent cDNA labeled with the at 
least one first label with the control set of fluorescent cDNA 
labeled with the at least second first label to for a dual labeled 
control cDNA; 

20 (d) hybridizing the dual labeled control set of fluorescent cDNA of 

step (c) to the comprehensive micro-array of step (a), wherein 
hybridization results in a detectable signal generated from the 
fluorescent cDNA; 

(e) measuring the signal generated by the hybridization of the dual 
25 labeled control set of fluorescent cDN A to the comprehensive 

micro-array of step (c); 

(f) subjecting the prokaryote or eukaryote of step (a) to a gene 
expression altering condition whereby the gene expression 
profile of the prokaryote or eukaryote is altered to produce a 

30 modified prokaryote or eukaryote ; 

(g) generating an experimental set of fluorescent cDNA from total 
or polyadenylated RNA, the RNA isolated from the modified 
prokaryote or eukaryote of step (e), the fluorescent cDNA 
comprising the first fluorescent label and the different second 

35 fluorescent label to step (b); 

(h) mixing the experimental set of fluorescent cDN A labeled with 
the at least one first label with the experimental set of 
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fluorescent cDNA labeled with the at least second first label to 
form a dual labeled experimental cDNA; 

(i) hybridizing the experimental set of fluorescent cDNA of 

step (h) to the comprehensive micro-array of step (a), wherein 
5 hybridization results in a detectable signal generated from the 

fluorescent cDNA; 

(j) measuring the signal generated by the hybridization of the 
second set of fluorescent cDNA to the comprehensive micro- 
array of step (g); and 
10 (k) comparing signal generated from the dual labeled control 

hybridization with the dual labeled experimental hybridization 
to identify gene expression changes within a prokaryotic or 
eukaryotic species. 

10. A method according to Claim 9 wherein the first fluorescent label and 
15 the second fluorescent label is independently selected from the group consisting of 

cy3 and cy5. 

11. A method according to Claim 9 wherein the prokaryotic or eukaryotic 
genome is comprised within an organism selected from the group consisting of 
enteric bacteria, Bacillus, Acinetobacter, Streptomyces, Methylobacter, 

20 Pseudomona, cyanobacteria, yeasts, filamentous fungi, plant cells and animal 
cells. 

12. A method according to Claim 1 1 wherein yeast are selected from the 
group consisting of Saccharomyces, Zygosaccharomyces, Kluyveromyces, 
Candida, Hansenula, Debaryomyces, Mucor, Pichia and Torulopsis. 

25 1 3 . A method according to Claim 1 1 wherein cyantobacteria are selected 

from the group consisting of Rhodobacter and Synechocystis. 

14. A method according to Claim 1 1 wherein filamentous fungi are 
selected from the group consisting of Aspergillus and Arthrobotrys. 

15. A method for quantitating the amount of protein specifying RNA 
30 contained within a genome comprising: 

(a) providing a comprehensive micro-array comprising a 
multiplicity of open reading frames synthesized from genomic 
DNA comprised in a prokaryotic or eukaryotic organism; 

(b) generating a set of fluorescent cDN A from total or poly- 

35 adenylated RNA isolated from the prokaryotic or eukaryotic 

organism of step (a); 

(c) generating a set of fluorescent DNA from genomic DNA 
isolated from the prokaryotic or eukaryotic organism of step (a); 
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(d) hybridizing the fluorescent cDNA of step (b) to the 

comprehensive micro-array of step (a), wherein hybridization 
results in a first fluorescent signal generated from the fluorescent 
cDNA for each open reading frame; 
5 (e) hybridizing the fluorescent DNA of step (c) to the 

comprehensive micro-array of step (a), wherein hybridization 
results in a second fluorescent signal generated from the 
fluorescent DNA for each open reading frame; and 

(f) dividing, for each open reading from , the first fluorescent signal 
10 into the second fluorescent signal to provide a quantitated 

measure of the amount of protein specifying RNA for each open 
reading frame. 

1 6. A method for quantitating the amount of protein specifying RNA 
contained within a genome comprising: 

15 (a) providing a comprehensive micro-array comprising a 

multiplicity of genes synthesized from genomic DNA 
comprised in a prokaryotic or eukaryotic organism; 

(b) generating a set of fluorescent cDNA from total or poly- 
adenylated RNA isolated from the prokaryotic or eukaryotic 

20 organism of step (a); 

(c) generating a set of fluorescent DNA from genomic DNA 
isolated from the prokaryotic or eukaryotic organism of step (a); 

(d) hybridizing the fluorescent cDNA of step (b) to the 
comprehensive micro-array of step (a), wherein hybridization 

25 results in a first fluorescent signal generated from the 

fluorescent cDNA for each gene; 

(e) hybridizing the fluorescent DNA of step (c) to the 
comprehensive micro-array of step (a), wherein hybridization 
results in a second fluorescent signal generated from the 

30 fluorescent DNA for each gene; and 

(f) dividing, for each open reading from , the first fluorescent signal 
into the second fluorescent signal to provide a quantitated 
measure of the amount of protein specifying RNA for each gene. 

1 7. A method for identifying gene expression changes within a bacterial 
35 species according to either Claim 1 or 2 providing for quantitating the amount of 

protein specifying RNA contained within a genome according to either Claim 15 
or 16. 
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1 8. A method for identifying gene expression changes within a genome 
according to Claim 8 providing for quantitating the amount of protein specifying 
RNA contained within a genome according to Claim 15 or 16. 
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